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ARRAYS OF NUCLEIC ACID PROBES ON BIOLOGICAL CHIPS 

5 Cros5-Ref erence t:o Related Application 

This application is a continuation-in-part of USSN 
08/284,064, filed August 2, 1994, which is a continuation-in- 
part of USSN 08/1*43,312, filed October 26, 1993, each of which 
is incorporated by reference in its entirety for all purposes. 
10. Research leading to the invention was. funded in part by NIH 

grant No. 1R01HG00813-01 , and the government may have certain 
rights to the invention. 

Background of the Invention 
15 Field of the Invention 

The present invention provides arrays of oligonucleotide 
probes immobilized in microf abricated patterns on silica. chips 
for analyzing molecular interactions of biological interest. 
The invention therefore relates to diverse fields impacted by 
20 the nature of molecular interaction, including chemistry, 
biology, medicine, and medical diagnostics. 

Description of Related Art 

" Oligonucleotide probes have long- been used to detect - 

25 complementary nucleic acid sequences in a nucleic acid of 

interest (the "target" nucleic acid).. In some assay . formats , 
the oligonucleotide probe is tethered, i .e., by ' covalent 
attachment, to a solid support, and arrays of oligonucleotide 
probes immobilized on solid supports have been used to detect 

30 specific nucleic acid sequences in a target nucleic acid. 
See, e.g., PCT patent publication Nos. WO 89/10977' and 
89/11548. Others have proposed the use of large numbers of 
oligonucleotide probes to provide the complete nucleic acid 
sequence of '.a target nucleic' acid but failed to provide'an 

35.. enabling method for using arrays of . immobilized probes for 
this purpose. See U.S. Patent' Nos; 5,202,231 and 5,002,867 
and PCT'-patent publication No/*v;o 93 /17126 . . - • 



8NSDOCID: <WO 9511995A1_L> 



^V^?5/li99^.^ J : ....„...:„.../„^^_ J: ...]^OTS94a2305^ 

- ' 2 

'The development ' of VLSIPS™' technol9gy .has provided .' 
methods for .making :very large . arrays .of oligonucleotide^ probes 
in very small arrays. See. U.S. Patent No. 5,143,854 and PCT 
patent publication Nos . WO .9 0/ 1507 0 and 92/10092 , reach of 
which is incorporated herein by reference. U. S Patent 
application Serial 'No. .08.2 , 937 filed June-. 25 , 1993, describes 
methods . for making arrays • of oligonucleotide probes that can 
be used to provide the complete sequence of a target nucleic 
acid and to detect the presence of a. nucleic acid containing a 
specific nucleotide sequence. . 

Microf abricated arrays of large numbers of,, 
oligonucleotide probes, called "DNA chips" offer great promise 
for a wide variety of applications." , New ; methods and, reagents 
are required to .realize, this promise, .and the present 
invention helps meet that . need. : - - - ^ ^ 

: ;^ . ;~ .SUMMARY GF THE INVENTION ... ^. , . ^ , 

The invention provides several strategies employing 
immobilized arrays of probes for comparing a reference - 
sequence of known sequence with a target sequence showing 
substantial .similarity /With the -reference sequence, but : 
^differing -in the presence of , e .g., mutations In a .-first 
embodimentv the invention provides a tiling strategy employing 
an array of immobilized oligonucleotide probes comprising at 
least two. sets. of probes. ..A first probe, set comprises -a 
plurality of probes, -each probe comprising a segment of at 
leas-t three nucleotides exactly complementary to/a subsequence 
of the reference sequence, the segment including at least one ^ 
interrogation position complementary to a corresponding 
nucleotide in the reference sequence.- A second probe set 
comprises cbrresponding -probe for- each, ptbbe ih- the first 
probe set, the corresponding probe in the second probe set 
being identical, to.; a sequence comprising .,the cprresponding 
probe f rom ' the; first, probe set or a subsequence of at least 
three nucleotides thereof that includes the at leasts one. 
interrogation position, except that the at .least one . \ 
interrogation position>,is occupied, by^ a. different, nucleotide 
in- each- pf the ,two" corresponding, probes, from the vf irst and 
second, probe sets. The probes .in the. first probe set' have at 
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least /two interrogation positions ^corresponding to. two 
contiguous nucleotides in the - reference sequence. One 
interrogation position corresponds to one of the contiguous 
nucleotides, and the other interrogation position to the 
other - 

In a second embodiment, the invention provides - a tiling 
strategy employing an array comprising four probesets. A 
first-probe set comprises a plurality of probes, each probe 
comprising a segment of at least three .nucleotides exactly 
complementary to a subsequence of the reference sequence, the . 
segment including at least . one * interrogation position 
complementary to a corresponding nucleotide in the reference 
segueince. Second/ third and fourth probe sets each comprise a 
corresponding -probe for each probe in the first probe set. 
The probes in the second, -third and fourth probe sets are 
identical to a sequence comprising the corresponding probe 
from -the first probe set or a subsequence of at least three 
nucleotides thereof -that includes *the af least- one - 
inter-rogation position, except that the at least one • 
interrogation "position -is occupied by- a* different • nucleotide • 
in each 'of ^the-four corresponding 'probes from the' four probe 
sets.. -The first probe 'set often has at least 100 - 
interrogation positions corresponding to 100 contiguous ' 
nucleotides in the reference sequence. Sometimes the first 
probe set has an interrogation position corresponding to' every 
nucleotide in the reference sequence / The segment of ' 
-complementarity within the probe set- is usually about 9-21 
nucleotides. * Although probes may contain leading- or trailing 
sequences in addition to the 9-21 sequences, many probes 
consist exclusively of ' ,a 9-21 segment of complementarity . 
. ,;. - In a third .embodiment, the invention provides - immobilized 
■arrays of probes tiled, for multiple- reference: sequences'. • One 
such array comprises at least' one pair of- first'- and second - 
probe- groups, each group comprising- first and second sets of 
probes as defined in the first embodiment . ^Each probe dn the' 
•first probe-set- from the first ' group is- -exact ly-' complementary 
to -a. subsequence: of-- a. first reference- sequence, - and' each^- probe 
in' the first probe set from the second group is- exactly • 
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complementary to a subsequence of -a, second reference :sequence . 
Thus, the first .group. of probes are. tiled with respect, to a 
first., reference sequence^ sind the second^ group of. probes, with 
respect to- a second reference sequence. Each group , of probes 
can. also include third, and fourth sets of probes as defined in 
the second, embodiment;. , In some arrays ; of this type ,, the 
second. reference sequence is a mutated, form of -the. first 
. reference., sequence. ^ _ , , . " 

In , a . f ourth embodiment, the invention provides .arrays for 
block tiling. -Block tiling is a species of the . general . tiling 
strategies described above. - -.The usual^ unit of. a, block .tiling 
array is a group of probes comprising. a wildtype probe,, a 
first set of three mutant probes and a -second set -of j three 
mutant probes. The , wildtype probe comprises a segment, of at 
least threes nucleotides exactly complementary to ^-a subsequence 
of.a reference sequence-,. The. segment has at least first and 
second interrogation positions corresponding to first and 
second nucleotides in the reference sequence. . The probes in 
the first set of three mutant probes are each identical to a 
sequence_cpmprising the -wildtype probe or . a .subsequence .of at.:, 
least three /nucleotides thereof including the first . and- second 
interrogation positions, except in the first interrogation 
position, . which is occupied by a different nucleotide in each 
of the three mutant probes and the wildtype probe . , ,The probes 
in the second., set. of three mutant probes are each identical to 
a^ sequence^ comprising, .the .wildtype probes .or a subsequence of 
at least -three nucleotides, thereof including the first ..and 
second . interrogation, positions , except in the second 
interrogation position, , which is occupied by a different; 
nucleotide; in each of the three mutant probes and the wildtype 

probe... . c . " ■ . ■ ' \ , ■ . .. . - J. : ^ - 

In. a. fifth embodiment, ^the invention proyides . methods of 
comparing .a. target sequence with a- reference sequence. using 
arrays . of .immobilized pooled probes . The, arrays; employed in 
these methods represent a further species of the, general, 
tiling .arrays noted. above. ^ In these methods variants . of a 
ref erence . sequence^. dif f ering_. f rom the ref erence , sequence, in at 
least one nucleotide are identified and each is assigned a 
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■ designatdon An- array of pooled probes is provided; with each 
pool- occupying a- separate cell- of the array. Each pool 
comprises a probe- comprising a/ segment exactly complementary 
to- each variant sequence assigned a particular designation. 
•The array is then contacted with a target sequence comprising 
a variant of the reference sequence. The relative • - - 
hybridization ' intensities of the- pools in the- array to the 
target sequence are determined. The identity of the target 
sequence- is deduced from the pattern of hybridization 
intensities. Often, - each variant is assigned a designation ' ' 
having at least one - digit and at- least one value for the 
-digit. ' In:. this case, each pool comprises a probe 'comprising a 
segment exactly complementary to^ each variant sequence 
assigned a particular value in^a particular digit When 
variants are assigned successive numbers in a numbering "system 
of base m- having n digits/' h x (m-l> pooled probes' are used 
are -used to assign each variant *a designation.' 

In^a sixth embodiment, 'the invention provides a pooled 
probe for -trellis tiling, a further species "of the general 
-tiling : strategy . - In trellis tiling; the identity 'of a ' 
nucleotide in -a target sequence is determined from a'* ' * 
comparison of hybridization - intensities of- three' pooled'" 
trellis -probes . - A pooled trellis probe comprises a segment 
exactly complementary to a subsequence of a reference sequence 
except at a. f irst interrogation position occupied'^ by a pooled' 
nucleotide N, a second interrogation position occupied' by a 
pooled nucleotide selected -from the group of three consisting 
of (1) M or K, (2) R or Y and (3) S or w,^ and a' third' 
interrogation- position occupied by a "-second pooled' nucleotide 
selected from- the group-. The pooled^ nucleotide" occupying the' ' 
second interrogation position comprises a nucleotide" " - 
complementary- to a corresponding nucleotide from" the reference 
sequence- when the- second pooled probe and" reference sequence 
are maximally aligned*, - and the pooled nucleotide ' occupying the 
third .-interrogation- position comprises a nucleotide^ " " 
complementary - to a- corresponding nucleotide from * the "reference 
sequence . when^ the- third poo-led probe and - the- reference ' ^'^ * 
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sequence are maximally aligned.* '. Standard - TUPAC^ nomenclature 
is: used for describing- pooled nucleotides . ' - \ ^ 

. , In trellis tiling; ' an array . comprises - at- least first , 

second and third, cells , respectively, occupied by first, . second 

. and "third pooled probes, each < according ' to* the ■ g.eneric : 

.description above. : However, the segment of' complementarity, 
location of interrogation positions ; and selection of ^ pooled 
nucleotide at each interrogation position may or may not 
differ -^between :the three pooled probes subject to the"^- ' 
following constraint, ' One of the three interrogation- ' 
positions in each of the three pooled probes must align* with 
the same -corresponding . nucleotide in the referencev sequence . 
This interrogation position must' be . occupied : by a N vin'^one of 
the- pooled probes , and a different pooled nucleotide inv each 
of .the -cOther two pooled probes. . ;^ i ■: . / : . ^o. 

~ /-^the'- invent ion^provides arr^a 

for bridge tiling. Bridge tiling is a -species of the - general 
tiling strategies noted above, in which probes from the first 

, probe set:, contain more than one segment - of . complementarity,. 
:In bridge tiling a -nucleotide : in a ref erence sequence is 
usually determined from :a comparison -of four probes. A "first 
•probe comprises at least first and second segments, . each of at 
least three nucleotides and each exactly complementary to 
first and second subsequences of a reference sequences. ' The 

..segments including at. least one interrogation position ' 
corresponding; .to- a nucleotide , in the reference sequence: 
Either (1)- the f irst and second subsequences are noncontiguous 
in the reference sequence, or (2) the first and second" 
subsequences are contiguous and the first and second segments 
are inverted relative to the \firs^ subsequences. - 

The arrays further comprises second, third^ and fourth probes, 
which are^ identical to a sequence comprising the first probe. 

' or- a subsequence thereof comprising at least three nucleotides 

,frpm each of the first- and second segments, except 'in -the at 
least one- interrogation position., which differs^ in- each -of the 
probes . ,^, 'In - a . species -of bridge tiling , . ref erred*, to as 

. deletion: tiling., • the f irst and second' subsequences are- - 
separated- by one or two: nucleotides in the reference sequence . 
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In - an' eighth ^ embodiment ;• . the invention provides -/arrays of 
probes for multiplex tiling. Multiplex tiling is ' a strategy , . 

• in. which : the identity of two nucleotides in. a target sequence 
is ^determined from a comparison of the hybridization 

5 intensities of four probes, each having two interrogation 
positions. -Each of - the probes- comprising - a segment of at 
least 7 nucleotides that is exactly - complementary .to . a 
subsequence from - a reference sequence, except that the -.seginent 
may or may not be exactly . complementary :at - two interrogation 
10 positions; - The nucleotides occupying the interrogation 

positions are-selected by the following rules:. • -(!)' the first 
-interrogation position is • occupied by a- different nucleotide 
. • in : each of -.the . four probes > .(2) 'the second v interrogation 
-position is occupied' by a different nucleotide in each of the 
15 four probes, (3) in first . and second probes, : the segment' is ' ■ - 

• exactly, complementary to the subsequence, except at no more 
than -one of the interrogation positions, r (-4 ) in third and 

.fourth :probes, tthe- segment is exactly complementary to the 
subsequence, except at both of the interrogation positions. 
20 •.:*\:In-a ninth . embodiment , the invention provides^arrays of • 

- • • ■'•immobilized probes including helper mutations.' Helper ' 

mutations are useful for, e,g., preventing self-annealing of 
. probes 'having inverted repeats. In this strategy, the ""^ 
identity of- a nucleotide in a target sequence is usually 
25 determined' from a comparison of four probes .'" A first- probe 
comprises a segment of at .least 7 nucleotides' exactly- ' ' 
: : complementary to a- subsequence of 'a- reference* sequence except 
at -one or two positions, the' segment -including an" 
interrogation positron not at the one or two positions'. The 
30 . one or two positions are occupied by helper' mutations . " 

- - Second, third and fourth, mutant probes are each identical to a 
sequence comprising the -wildtype -probe* or a subsequence 
thereof --including the interrogation position and the one or 
rtwo positions, except in the interrogation position, which is 
35 - occupied- by a: different nucleotide in each of the four " probes*. 

^ In- a- tenth embodiment-, the invention provides arrays of 
probes comprising at least :two probe sets,- but. lacking^ a^ probe 
set comprising probes that are- .per f ectly matched; to ' a ' 
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.referencei.-sequence. Such arrays are usually. employed in 
methods in which both • reference and target ; sequence- are 
hybridized, to -the ..array. The. first .probe ^ set comprising a 
plurality of. probes, each probe comprising a segment exactly 
complementary to a subsequence of at least ..3 nucleotides of a 
reference sequence except at an interrogation position. The 
second probe set comprises a corresponding probe for. each 
..probe, in the first probe, set, the corresponding probe in the 
.second probe,- set being ..identical . to a sequence . comprising the 
corresponding probe from the, first probe set .or a -.subsequence; 
of at least three nucleotides thereof that includes. the. 
interrogation position, except that the interrogation position 
, is occupied .by a different nucleotide in each of the .two 
corresponding probes and the complement to .the reference. 

-■sequence.;;, •.. !-:,■. • • . - , ■ ■ •- •• - - ■ •*.■•■■• ■- ■•- '■ 

, In an . eleventh -embodiment , . the., invention ..proyides :i?iethods 
of comparing a target sequence with a reference sequence 
comprising a predetermined sequence. of nucleotides .using any 
of -the arrays described . above . The methods comprise.. .. 
hybridizing the target nucleic acid to an: array .and.. ,., 
determining...which probes , relative to one another . in -the 
array bind specif ically to the target nucleic acid. . The 
relative specif ic binding of the probes indicates, whether the 
target sequence is the same or. different from the reference 
sequence. In -some such methods , the. target sequence has a 
substituted nucleotide, relative to ; the .reference , sequence in 
-,at.-least one undetermined position, and the relatiye specif ic 
binding of the probes indicates the location of .the position 
and the nucleotide occupying . the position in the .target, 
^sequence,," • I a second target, nucleic-^acid is . 

also hybridized. to the array. The relative. specif ic binding 
of ..the probes ._then.. indicates 'both .whether, the target sequence 
is, .the same or different from -the ..reference . sequence , and 
whether: the second target sequence is, .the ..same- or .-different 
from the ref.erence. sequence. In some methods , . when . the:.. array . 
^comprises two groups .of probes . tiled . for . first and. second 
- ref.erence. sequences., respectively,- the relative, specif ic 
binding of. -probes in the. first group .indicates. whether„the 
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target -sequence - is --the- -saiae or different from the: first - 
reference sequence . The relative .specific binding, of probes 
in the ^second group indicates .whether the target sequence is 
the same or different -from the second reference sequence. 
5 • Such methods are particular ly useful for analyzing. 

heterologous alleles of a gene. " Some methods entail 
hybridizing both a reference ^sequence and a target sequence to 
any of the arrays of probes described above. Comparison of 
the relative 'Specific binding of the probes to the reference 
10 and target sequences indicates whether the target sequence is. *, 
the same-or different from the . reference sequence.,. . 

. -In a -twelfth . embodiment, .the invention provides .arrays of 
immobilized probes in which the .probes are designed, to tile a 
reference sequence from .a. human immunodeficiency virus...... 

15 Reference sequences from either the reverse transcriptase gene. 
- or protease 'gene -of -HIV .are. of ..particular interest. Some 
chips.'further comprise arrays of ^probes. tiling a reference 
. sequence -from a 16S RNA or DNA encoding the 16S ;RNA: from a 
pathogenic -microorganism. The invention further .provides 
20 methods of using such arrays in analyzing a HIV .targets ' 
-sequence .The methods are particularly useful where the' 
target • sequence - has a substituted nucleotide -relative to the - 
reference sequence in at least one position, the substitution^ 
" conferring resistance to a drug use in treating -a patient 
25 ' infected -with a HIV virus.- -The methods reveal .the existence '^^ 
of the- substituted nucleotide.- The methods are also 
-particularly useful for analyzing a mixture- of undetermined 
proportions of first and second target sequences from./ ^ 
different- HIV variants. -The . relative specif ic- binding of 
30 ptobes -indicates -the proportions of the first and second^ 
. target sequences . .... • 

- In -a thirteenth . embodiment , -the , invent ion provides arrays 
of probes tiled -based on reference -sequence from a CFTR. gene. 
" A ■ preferred -array - comprises -'at least a group of. probes 
35 . ■ comprising ^ a -wildtype -probe, and.-.:five- sets of three mutant 
probes . ^ The • wildtype . probe . is exactly .complementary .to a 
subsequence of .a -reference * sequence from a. cystic, /fibrosis 
gene., the segment ..haying at -.least . five interrogation .positions 
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corresponding to five contiguous nucleotides- in- the reference 
sequence. The probes in- the^first set of three inutaht probes 
are each identical to -the wildtype probe, .except in a first of 
the five interrogation positions, which is occupied by a 
different nucleotide in each of the three mutant probes and 
the wildtype probe. The probes in the second set of three 
: mutant- probes are each identical "to the wildtype probe/ except 
in a- second. of the five interrogation positions, which is 
occupied by., a different nucleotide in each of the three mutant 
probes and, the wildtype probe. The probes in the third set of 
three mutant probes are each identical to the wildtype probe, 
except in a third of the five interrogation positions, which 
is occupied- by a- different nucleotide in each of the three 
mutant probes and the wildtype probe. The probes in the 
fourth set, of three, mutant probes-. are -each identical- to- :the 
wildtype- probe y. except . irn^ a fourth of - the five" interrogation 
positions, which is occupied by a different nucleotide in each 
of the three. mutant probes and the wildtype probe. The probes 
.in. the fifth set of three, mutant probes are each, identical to 
the wildtype probe, except in a fifth of the f ive— ^ . > 

interrogation positions, -which is occupied by^ a different 
nucleotide in each of - the three mutant probes and . the wildtype 
probe. Preferably, a chip comprises two such- groups of 
probes.. The .first group comprises a wildtype probe exactly 
complementary to a first reference. sequence, ■ and, the^ second 
group comprises a wildtype probe exactly .- complementary: to a 
second reference sequence that^ is a mutated form of:- the first 
reference . sequence..-. ■ . . ^ . . . . 

The invention further provides methods of using the- 
arrays 'df the invention for analyzing ^ sequeni:e3- f rom a' - 

CFTR .gene. ..The methods are ■ capable of simultaneously 
analyzing first and second^ target sequences , representing- 
heterozygous^ alleles -of a CFTR gene. 

-In a -fourteenth embodiment, the - invention , provides arrays 

of probes tiling a reference .sequence- from- a- p53 gene, . an 
hMLHl. gene and/or an ;MSH2 gene. - The invention • further 
provides, methods- of- using- the -arrays described above -to-^ - 
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" ' -analyze* these genes. • The method-are usef ul , ' *e . g / f of"- 
■ 'diagnosing patients susceptible to developing cancer, 

■ In a" fifteenth embodiinent / the invention provides arrays 
of probes tiling a -reference sequence from a mitochondrial 

5 ' genome; The ' reference 'sequence may- comprise part or all of 
the p-loop region, or all, or- substantially all,- of the' - 

* mitochondrial genome". The invention further provides- method 
of using the arrays -described above to analyze target 
sequences from a mitochondrial genbine. • The methodis are useful 

10 for -identifying mutations associated with " disease , arid for 
forensic, epidemiological and evolutionary studies. — 

■ - ^ • BRIEF DESCRIPTION OF THE - FIGURES ■ ^ . ' • 

■ .r:: Fig* T:" Basic tiling strategy The ' figure illustrates 
15 •■ the relationship" between an interrogation' position '(I)" and a ,^• 
'^ . corresponding nucleotide . (n) in the reference sequence, * and 

between a" probe from the" first probe set and corresponding 
: probes from second, third * and fourth' probe set^.' ■ 

• Fig.' 2^: Segment of complementarity in a 'probe from" the 
20 first probe setT, - ■ * - - . . i .v.- . , ,^ ■ 

Figi 3 ' Incremental * succession - of probes in a basic . 
tilinig strategy. " The f igure shows ' four probe sets, 'each 
having three ' probes . Note . that each probe differs from Its *• 
predecessor in the same - set by • the acquisition of a '5'. 
25 ' • nucleotide" and' the loss of a 3' nucleotide," as well ' as 'in the * 
nucleotide occupying the interrogation position ■ 
' *• '^Fig-^'A: ' Exiemplary arrangement of * lanes on a ' chip ^ The 
chip shows four probe sets, each having five probes'' and each 
having a -total of five interrogation- positions' (11-15) , one 
30 per probe. - . ' • " " • ' . . : 

"FigJ'S: Hybridization "pattern ' of chip - having ' probes laid 
down'in ^ lanes. Dark patches - indicate hybridization.^ ' The 
probes in the lower part- of the figure- occur- at the^ column of 
the array, indicated -by the • arrow 'when the probes -length is 15 
35 and the interrogation 'position 7 ; " - ••• - . .. v: 

Fig:- 6 r.^ ^Strategies f or - deteeting del'etTon "krid- insertion 
mutations.- Bases -in brackets • may "of *rnay not be- present'.' 
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r- r: Fig. 7 : ; Block' tilings strategy. " -The : probe : from, the first 
probe set has three interrogation positions; - The- probes from 
the other-probe sets have only * one \of- these* interrogation 
positions. " 

Fig. 8:.: Multiplex .tiling strategy . Each probe, has two;, 
interrogation positions; ; 

> . \c Fig. 9 Helper mutation strategy ... The segmentcof 
complementarity differs from .the complemerit of the . reference 
sequence -at . a helper :mutation. :as /well as the interrogation 
position;: ' . : . : - ; : . . . 

:. / ;Fig- 10 .Layout of probes on -the HV 4 07 -chip. : The : figure 
shows 'successive rows of. sequence each of which . is subdivided 
into four lanes -The four lanes correspond -to the A-;;C-, G" 
-and T-lanes. on the chip. -Each probe is ^represented by -the 
nucleotide occupying ;its - interrogation- position, r The' letter • 

. "N" indicates a control p2-obe^or'*empty^..Golumn,^ The different 
sized-probeS' are laid out in parallel. That is , from . top-to- 
bottom, a row of 13 mers is followed- by a row of 15. mers,' 
which 'is followed by a row of 17 mers, Xwh'ich is followed by a 
:row--:of ^-1.9 mers • -r./: \ s --^^^^ 

:J r' ' Tx^:' 11^' Fluorescence pattern of HV ':407 hybridized to a 
target sequence (pPoll9) identical to ^the chips .reference 
sequence. - - . ■ . ■ : ; . - 

, Fig. 12 Sequence read from HV 407 chip hybridized to 
pPoll9 and 4MUT18 (separate experiments) The reference 
sequence as designated "wildtype Beneath thevxef erence 
sequence -are four rows of sequence read, from the chip / ./ 

' hybridized to ^thepPoll9 target, the , first row :being read from 

13 mers,. the second row • -from 15 .mers, the third row from 17 
Miners and - the ; f burth: ; row^, f;rbm::;19i^^^^^ ; T .Beneath thes.e ' v. ; V 
sequences , .there "are . four further rows of sequence .read from 
the .chip- hybridized to the 'HXB2 .target. Successive rows are 
read. ;from rl3 mers, .15 \mers, :17 mers and 19 mers. \ Each 
nucleotide :in a :row is called- from' the -relative fluorescence 
intensities ot probes in . A- , G- and T-lanes,. : Regions of 

ambiguous .sequence read . from the chip are -highlighted.'. .The 
strain differences between the HBX2 sequence and rthe: reference 
sequence that were correctly detected are indicated (*) , and 
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. those- that- could hot ■ be called- are indicated • (o) ; ^ - (The 
nucleotide at position 417 was read correctly in some 
experiments) . 'The location. of some mutations known to be 
associated with drug resistance that occur in readable .regions 
5 of the. chip are shown. above (codon number) and below (mutant 
nucleotide) the sequence designated "wildtype." The locations 
of primer used to amplify the target .sequence are indicated by 
arrows.* : " .-^ ■ . ^ - . \ - . ■ ' ^ ■ :: : • 

"Fig; 13: Detection- of - mixed, target sequences. -.The 
10 mutant target differs from the wildtype by a single. mutation : 
: in codon -67 of the reverse transcriptase • gene ; .Each different 
sized group .of probes has ' a column of four probes* for 'reading 
"the -nucleotide in which the mutation occurs ; The f our '"probes 

* occupying 'a column are represented by a -single - probe in the 
15 ' figure with the - symbol ■* (o)' indicating the- interrogation ' 

• ' " '^position, which - is occupied by a different nucleotide in each 

jprobe.; - : ' -* ' ■ " . : * ■ . . 

Fig.*. 14: -: Fluorescence .intensities of target -bound to 13-- 

• . mers :and 15 mers for different proportions of / mutant.' and ^ 
20 wildtype target. The fluorescence intensities are from probeis 

.-./having . interrogation positions- for- reading the nucleotide at 
which -the mutant ' and wildtype targets ^-diverge. 

Fig. 15: Sequence read from protease chip from four 
- clinical- samples- before and after treatment with'ddl>. 
25 - V Fig. 16-: Block tiling array of probes for- analyzing a — ' 

CFTR point mutation . Each probe show -actually represents four 
probes/ with :'.one probe having each- of- A, C,- G or 'T- at '-the 
interrogation position. N. In the - order shown , - the -first probe 
shown on the-left is -tiled from the " wildtype • reference - 
3 0 sequence, the second probe from "the mutant . sequence, and- so on 
'in alternating fashion. Note. that all of 'the probes, are 
- ; identical except at the interrogation posit ion which shifts 
one -.position -between successive probes tiled from the "same 
..- reference sequence (e.g.,. the first, third and • f if th probes in 
35 - .the left hand column.) The grid shows • the hybridization 
:intensities when \the .array is ^-hybridized to the ^reference 
; /^ sequence;. -c -v. . r": '.: 7.,--:;-..:':.,:: -..::^..7> 
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Fig.- 17 V. Hybridization pattern ^ for. heterozygous target. 
The -figure shows the hybridization pattern when the .array .of 
the previous figure* is hybridized to a mixture. of mutant and 
wi^ldtype .reference sequences. ; . ■ — v , 

Fig.; 18,: in -panels A, B, and C, -shows an image made, from: 
: the region of .a DNA chip containing CFTR exon 10 probes; ^. in 
panel A, the chip was hybridized to a, wild-type target; in 
•panel Cr the chip -was hybridized to . a mutant AF5 08 .-garget ; and 
in- panel B, .the chip was hybridized to a mixture .of the 
wild-type . and mutant targets. . - 

^.Fig. 19 , in sheets 1 3 corresponding to panels A, B, 
and C of • Fig. 18,- shows graphs of- fluorescence intensity 
versus tiling position. . The labels on the horizontal axis 
show the- bases in the. wild-type sequence corresponding to the 
position, of substitution .in -the respective probes, . Plotted 
are the vintensities- .^ .the features Cor.. .sy:nthes is 

sites) containing wild-type probes, the features containing 
the substitution, probes, that bound the .most target ■{ "called" ) , 
and the, teature containing the substitution probes , that . bound 
the, target with the second .highest, intensity „of all .the^.. 
substitution, probes ("2nd Highest") . . 

Fig. ,20,. in panels A, B, and C, shows an image made from 
a... region of a DNA chip containing CFTR exon 10 probes ;, in 
panel. A,, the chip was hybridized to the .wt480 target; in. panel 
C, the chip was hybridized to the mu48Q target; .and in panel 
B,. the chip, was .hybridized to a mixture of; the wild-type and 
mutant targets.. - 

^ Fig. , 21, . in. sheets 1 - 3., corresponding to panels A, B, 
and C of. Fig,.. 20, shows graphs of fluorescence, intensity,, 
versus tiling, position. The labels on the horizontal axis 
show the bases in the wild-type sequence corresponding .to the 
position, .of substitutipn in the respective .probes., . Plotted 
are, the intensities observed,,.f rom . the features, (.or .synthesis 
sites) containing, wild-type probes, the. features ^.containing 
the .substitution -probes .that bound^the, most .target . ("called" ) ; 
and^the feature containing the substitution ..probes, that ^bound 
the., target, with, the , second ..highest ...intensity .„of .all -the.-.- , 
substitution probes .("2nd.. HighestV) ^ - ^ ^ 
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'Fig * ■ 22 , - in panels^A and'-B/*"shows an '"'iinage made" from a 
region*of a DNA chip containing CFTR exon-10 probes; in-panel 
A; the' chip was^ hybridized to nucleic acid -derived from the 
genomic DNA of an individual with ' wild-type AF508 sequences; 
5 -inpanel.B, the target nucleic acid originated • from a 

heterozygous (with respect to the AF508 mutation) -individual. 

Fig. 23 , - in ' sheets - 1 and 2 , corresponding to panels A and 
B of Fig. 22, shows graphs of fluorescence intensity versus 
tiling position.' The labels oh the horizontal axis show -the 
10 bases in the wild-type sequence -corresponding to the- "position 
of substitution in the respective probes. Plotted' are the 
intensities; observed from* the features- (or: synthesis Z^sites) 
containing "Wild-type probes , the features- containing "the- 
substitution -probes that -bound the most target' ( "called" )> and 
15 V the ■ feature containing the substitution probes that bound- the - 
target with -the second highest intensity -of air the' - 
- substitution probes ("2nd Highest") . ' 

" Fig. 24': Hybridrzation of * homozygous . wiTdtype (A) : and 
heterozygous (B)* target sequences from exon n' of- the' CFTR 
20 gene^to^a block 'tiling array designed to * detect G5 5 ID' and" 
Q552X mutations in CFTR gene. ' ■ /.r . , , 

'Fig. 25: Hybridization of homozygous wildtype (A) and 
AF508 "mutant (B) target sequences from exon" 10 of the CFTR 
gene to 'a" block tiling array designed to detect mutations, ^'^^ 
25 ■ AF508, -AI507 and F508C.' ■ ■ — - - ^^ : ^ . 

V - .'^ "Fig 26 : Hybridization of heterozygous mutant target 
sequences , AF508/F508CV to the array of Fig, 25 . • "'' -^ 

' Fig:- 27 shows the alignment of some' of the probes on a 
p53DNA"chip with " a 12-mer model target nucleic acid. 
30 - ^ Fig. 2 8' shows a- set of 10-mer probes for' a p53 'exon~ 6 DNA' 
'' chip — ^ ■ ' • - . : ' . ■ ' \ ... n 

^ -Fig.- 29 shows,' that ■ very distrnct patterns are- observed 

"after- hybridization pf*''p53 DNA chips; -with' targets- havings- 

* -'dif-ferent "a* 'base -substitutions • In' the^ fdrst image in^Tig. 
35^ 29,- the 12-mer probes that form perfect matches with the' 
wild-type "target are in' the first row -(top) . ''The- 12-mer ' 
probes with single ^base mismatches -aire located in- the- second, 
third, and fourth rows ^^and have much lower signais * 
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' Fig- ;3 0, in graphs .2 , 3., .a,nd-4 graphically^ depicts the 

data in Fig_. 29 ^ On each graph, the X ordinate is the , 

^rposition of' the probe ^in its. row. on . the chip, ..and the Y 

- -ordinate iS; the'-signal at . that probe site after hybridization. 
5 • — Fig- -31 .shows the results ::of hybridizing mixed, target 

populations of; WT -and .mutant p5 3 genes to the p5 3: DNA ■ chip • 
Fig-; 32 in graphs 1-4 , . shows (see Fig . 30 as . well) the 
hybridization efficiency of a 10-mer probe array ; as compared 
to_a -1-2 -mer' probe array - v : > . rr ■ 
10 ^ - \Fig. 33^ shows, an image of a p53 DNA chip hybridized to a 

target' DNA, % ^ - - - : : r ' . . • * ■ ' - ■ 

. ^ . Fig-. , 34 ii-lustrates how :-the -actual • sequence was read f rom 

- ,the chip ;Shown; in. Fig-, 3 3-- ; Gaps in the sequence ;of letters in 
•/1:he.,WT rows ;correspond rto controT probes^., or - sites . , Positions 

15 . at which bases, are miscalled are represented by letters in j ;. 
italic- 4::ype> in- cel^S;- corires 

bases have been substituted by other bases. 

Fig.; 35 shows the human mitochondrial genome;. ."Opj" is the 
H/strand.. origin of replication, and arrows indicate the cloned 
20 unshaded - sequence. - • - - - : > 

//-.v.. Fig.. .36 shows ^the -image observed from -application :.of a 
sample- of- mitochondrial- -DNA derived nucleic acid (from the mt4 
sample) on a DNA chip. 

. Fig. 37 is similar to Fig. 36 but shows , the. image . . 

25 observed from the -mtS sample. • - , / - 

Fig. . 38 shows, the . predicted" difference image -between the 
. mt4.- and- <mt5 samples^ on the DNA chip >asedvon: mismatches 
. . between^ the two - samples- and the reference sequence. • - 

Fig. 39 shows the actual difference image observed, for 
30 / / the^ mt4 and , mtS Vs^ y . . T ' \ J .J 

Fig. 40, in sheets* 1 and 2, shows a plot- of-- normalized 
intensities across . rows -10 and 11 o.f the; array and. a 
tabulation of the mutations detected . : - .. * ; - : .r ' 

Fig. - 41 shows the -discrimination between wild-type and 
35 mutant hybrids -obtained: with, the- chip.* A., median^: of '^^the six 

n(prmalized hybridization scores-, for ' each, probe':; was "taken; the 
graph plots the ratio of the median score to the normalized 
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hybridization'- score versus mean -counts "A ratio - of 1.6 and 
mean ' counts- aboVe 50 'yield no false positives. 

■ Fig. 42- illustrates- how the identity of the base" Tni'smatch 
may influence the ability to discriminate mutant - and wild-type 
5 sequences more than' the "pos ition -of the mismatch within an 

oligonucleotide probe . The mismatch position is expressed - as 
% of ^ probe -length from ' the 3 * -end. The base change is 
indicated on the graph. - - ~ ' 

Fig. 43 provides a 5' to 3 ' sequence listing of 'one' 
10 ^ target - corresponding to the probes on the' chip . - X is a 

control probe. Positions that differ in the target '(i.e. ^ are 
mismatched -with: the probe ' at the designated site) ^ are in bold. 

Fig ..44 -shows the fluorescence image- produced by scanning 
the "chip.^ described in -Fig. 17 when hybridized to'a sample. 
15 X. :.;Fig. 45 illustrates the detection of ' 4- transitions' in the 

target - sequence relative to the wild-type probes' on- the* chip 
in Fig. 44 . * - - ' . • - - " ^ • • ■■ - - ^ 

' Fig-' ^e: VLSIPS™ technology ^ applied- to the^ light 

directed^^"synthesis of oligonucleotides . - ' Light- (hv) • is ■ shone 
20 through a mask (M^^) to activate functional ^groups (-OHy^'on a 

surf ace- by. -removal* of a protecting group (X) . Nucleoside > 
building blocks protected with photoremovable protecting- 
groups (T-X, C-X) are coupled to the activated areas. By 
repeating the -irradiation and coupling steps , very complex 
25 arrays of oligonucleotides can be prepared. ■ - - " 

- - *Fig.': 47: • Use' -of ' the * VLSIPS™ process to prepare 
"nucleoside combinator ials" - or oligonucleotides synthesized by 
coupling-:alL four nucleosides to ^ f orm' dimers , trimers, and so 

forth'.. • ■ - r r. : . . : 

30 Fig. 48: Deprotect ion , coupling , and -oxidation-steps of-* 

' a solid ■ phase- DNA synthesis method. - ' - < - ' • 

-Fig." 49":^ -An'- illustrative synthesis route^^ f or ■ the - - 
nucleoside building blocks used in' the VLSIPS"™ method; 

■ .Fig".- 50: - - A' preferred photoremovable -protecting group, 
35 • MeNPOC,- and . preparation" of the group in - active -f orm^ ' ■ • 
: :Fig.-- 51 :- ^ -Detection: system^' for . scanning a DNA 'chip.- 
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V -DETAILED DESCRIPTION;: OF/ THE INVENTION-'- 
. ^ ^ The ■invention, provides a number of-- strategies, f or . 
comparing polynucleotide of known sequence (a reference 
sequence) with variants of that sequence (target sequences). 
The comparison can be perf ormed . at: the :level' of entire - 
genomes, chromosomes , -gfenes , exons or; introns, or can focus on 
individual mutant sites and -immediately adjacent bases. The 
strategies-allow. detection of variations , such as. mutations or 
polymorphisms, : in the target; sequence irrespective whether a 
particular variant has previously been characterized. . The 
strategies both define the nature of^ a variant- and .-identify 
its location : in' a. target sequence. — - 
■ The 'Strategies employ , arrays- of oligonucleotide probes 
immobilized to . a solid support . ; Target sequences are -analyzed 
by determining „ the -extent of. hybridization at particular 
r probes^^in^the array . - The: strategy^^in; , selection... of spra^ 

facilitates -distinction^between perfectly matched- probes and 
probes showing single-base or other degrees of mismatches. 
The: strategy , usually entails, sampling each- nucleotide of 
: interest "in -a target sequence several times , thereby achieving 
a '.high degree of confidence in - its. identity.- This; level- of 
confidence is further increased by sampling of adjacent 
nucleotides in; the target sequence to nucleotides of interest. 
The. number of probes on the -chip can be quite large (e.g., 
10^-10^) ;.' * However , usually only a small proportion of the . 
total number of probes of a; given length- are. represented. 
-Some advantage .of , the use of only ; a small - proportion of. all 
.possible probes of a given length include: (i) each- position 
in , the array is highly informative, whether or- not 
.hybridizatipn oc^ nonspecific hybridization is 

^minimized;--(iii) it is; straightforward to correlate- 
hybridization -differences- with sequence differences 
particularly with reference- to the' hybridization pattern- of a 
known standard; and. (iv) .the; abilityr to address each probe 
independently during synthesis using, high resolution , , 
photolithography , . allows the.- array to -be designed.- and. ; . ... 
optimized , f or, :any , sequence. . For. example the length of any 
probe - can be varied, independently of /the others. ■ v., , : 
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The present tiling strategies result in sequencing and 
comparison methods suitable for routine large-scale practice 
with a high degree of confidence in the sequence, output , ' 

I. ' GENERAL TILING STRATEGIES ■ . • - 

■ A'. Selection of- Reference Sequence • • . • . . . - / 
■-The chips are designed to contain probes exhibiting 
complementarity to one or more selected reference sequence 
whose sequence is known.- -The chips are used to - read - a' target 
sequence* comprising either the reference • sequence ' itself -or 
variants of that sequence. Target sequences may ^differ - from 
the reference sequence at one or more 'positions • but show a 
high overall degree of sequence identity. with the reference 
sequence (e .g.,- at' least 75,-90, 95-, 99,- 99.9 or 99.99%). Any 
polynucleotide of known- sequence can be -selected as ai * 
reference sequence. Ref erence sequences of interest - include 
sequences known to include mutations or ' polymorphisms 
associated With- phenotypic changes having- clinical 
significance in human patients. For example, the CFTR gene 
and \P53 gene in-humans' have been identified as'the^-lbcatiori of 

- several' mutations resulting in • cystic " f ibrosis or cancer 
respectively. Other reference sequences of interest include 
those that serve to identify' pathogenic microorganisms and/or 
are- the site of mutations by which such microorganisms acquire 
drug resistance - (e ig. > the HIV reverse transcriptase gene) . 
Other- reference' sequences - of ^ interest • include -regions- where 
polymorphic variations are known to occur • (e .g. , -the D-ioop 

^region of mitochondrial DNA) . - These ref erence sequences have 
utility for> e'/g--, forensic or epidemiological studies . Other 
reference sequences • of interest include p34 (related - to p53 ) , -* 
p65 (implicated in breast prostate and liver - cancer )> -and DNA 
segments ' encoding cytochromes' P450 • (see Meyer et • al.-/ Pharmac \ 
Ther. 46, 349-355 ( 1990)') . Other ■ ref erence -sequences * of 

"interest include those from ^ the ' genome of pathogenic -viruses 
(e.g. ; hepatitis (A, B," or C) v * herpeS' virus (e Vg; > - VZV, • HSV-l ; 
HAV-6, HSV-II'> and' CMV, : Epstein- Barr- virus) , adenovirus;''. 

• inf luenza- virus", f laviviruses , (schovirus / rhihbvirus - 
coxsackie ■ virus-; cornovirus , • respiratory - syncytial ~ virus ,. 
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viru.. rotavirus „ .easles^ ^i.us . ..rubel la v^r^s , _ 
parvovirus, vaccinia virus,.. HTLV virus , . dengue vxrus , 
plpillcavirus:, .olluscu... virus,, poliov.ru. , . rabxes ;V.rus , JC 
. virus, and arboviral encephalitis yirus ... Other reference 
sequences of- interest;, are ,froi. .genpmes or epison.es.of ^ 
. pathogenic bacteria , p.^rt icular ly, regions that conf er, drug 
resistance -or-allow phylogenic characterization of the host 
(e g 16S .rRNA or corresponding DNA) . /.For example,, .such 
bacteria., include chlamydia, rickettsial bacterx^ ,.. 
:„ycobacteria, staphylococci, treptocci, .poeumonocoqci,,.._ 

meningococci and conococci, Klebsiella, proteus,^ .serratxa, 
pseudomonas, Legionella, diphtheria, salmonella, .bac^llx, 
cholera,.:tetahus, botulism, anthrax,.. plague, leptosp.ros.s , 
' and Lyines. disease, bacteria. Other reference sequences _of 

interest include those . in which mutations: result ,xn the . . 

" ^f o^lowing lautosomaUxecgssive d^ ■ sickle cell .anemia, 

^-thalassemia, phenylketonuria, galactosemia, .Wilson .'.s ,^ 
disease, hemochromatosis, .. severe combined, immunodeficiency, 
alpha-1-antitrypsin deficiency,^ albinism,, alkaptonuria, 

lysosomal, storage diseases and Ehlers-Danlos syndrome , Other , 
reference sequences of interest include those in which 
mutations result in X-linked recessive., disorders: hemophilia, 
glucose-6-phosphate dehydrogenase, agammaglobulimenia , - 
diabetes insipidus ,. Lesch-Nyhan syndrome, muscular dystrophy, 
. Wiskott-Aldrich, syndrome, Fabry.- s disease^ and f ragile- X- 

syndrome Other reference, sequences, of .interest , includes 

those, in which mutations result., in the f ollowing, au,to,son>al 
dominant disorders :. I familial hypercholesterolemia ,. p.olycystic 
. kidney., disease, Huntingdon • s, .disease , hereditary 

3 0' ■ : spherocytosis , Mar fan - s syndrome , ..von. Wi 1 lebrand's, disease ^ ^ 

neurofibromatosis, tuberous, sclerosis;, hereditary hemorrhagic ' 
telangiectasia,, familial colonic polyposis. . .Ehlers-Danlos ■ 
syndrome , . myotonic dystrophy,, muscular , dystrophy , osteogenesis 
imperfecta.,., acute intermittent porphyria, and von Hippel- 

35 Lindau-.disease... , : •• ■ . 

The. .length of..,a, re.f erence .sequence .can. .vary. .widely, from a 
full-length genome," to .an .individual., chrom.ospme ,^,^.,epis 
.gene, component , of .a gene , ^such ^as .an ,exon ,,. ^.Ir^tron ^or;^.^ 
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regulatory sequences/ to a 'few nucleotides'. A -reference' 
sequence of between about 2/ 5, 10, 20, 50, 100, 5000, 1000, 
S/OOO or 10,000*, 20,000 or 100,000- nucleotides- is -common. 
Sometimes only particular regions of a sequence (eig. , -exons 
5 of a gene) are of interest: ' In such' situations, -the 

' particular regions can be considered as separate reference 
sequences or can be considered as components of a single 
reference sequence, as matter of arbitrary choice. ■ 

A reference sequence "can be any naturally occurring^ 

10 mutant /'consensus or purely hypothetical- sequence of -' 

nucleotides, RNA' or DNA." For example, sequences ^can^ be • 
obtained from computer ' data bases publications ' or -can ^be 
determined "or conceived de^ novo:- Usually,- a- reference 
sequence is selected to show a high degree of sequence- 

15 identity to envisaged target sequences; Often, particularly, - 

where a significant 'degree of divergence is anticipated - 

between "target sequences, more than one reference sequence is 
selected. ."Combinations of wiidtype "and mutant reference 
sequences are employed in several applications of the tiling 

20 strategy. ' " " ' ' ' ' " v.., i , ^ 2.-:... 

H'.' ' 

B. Chib Design - - - , ■ .: . 

1. Basic Tiling Strategy ' - 

The basic tiling strategy provides an array -of 
25 immobilized probes for analysis ' of target sequences - showing a- 
high degree "of sequence identity to one or -more ' selected 
'reference sequences. The strategy is first • illustrated '-f.or an 
array that is subdivided- "into fourprobe sets-,- although it 
will be apparent that "in some 'situations satisfactory results 
30 are^ obtained from only two' probe sets, A first probe set 
^ comprises a plurality of probes exhibiting perfect - - ■ 
'complementarity with a selected reference' sequence The 
perfect complementarity" usually exists throughout - the - 'I'ength 
"of the probe.' However , probes ^having a "segment or- segments of 
35 perfect complementarity that is/are flanked by- leading or 

trailing' sequences 'lacking "complementarity to ' the ■ reference 
"sequence can also be used Within* a 'seginent of ■ ' - 
complementarity, each probe' in 'the first probe-set has- -at 
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' ' .least one • interrogation- pos it ion r-t-hat vcorresponds - to a • . 
; nucleotide /in the reference sequence .• . That -;.is ; the r :[ : 
interrogation position is aligned -with, the- corresponding, 

nucleotide in the reference sequence, when the probe and 
5 reference sequence . are aligned to maximize complementarity 

between the -two. If a. . probe has .more than one interrogation 
position:, each corresponds with a respective nucleotide in the 
reference sequence.. The -identity of an interrogation. position 
. -and- corresponding nucleotide .in a particular probe in the 
10 first probe set cannot be determined simply. by inspection of: :. 
the probe, in the first set. As will become .apparent,"an 
interrogation position and corresponding nucleotide is defined 
. by. the comparative structures of probes, in the first probe set 
and .cp.rresponding. -probes from additional, probe sets.' - T i 
15 , -...,.^ -in-princdple.-, a- probe ;could have an. interrogation " 

position-at each -position. .in. the. .segTOent.,.c;ompl.e^^ the 
reference sequence. ■ Sometimes, interrogation-positions u 
. provide more accurate data when located away from the ends of 
a -segment of. complementarity. Thus, typically a probe having 
20 a. segment -,of --complementarity-, of length- x does not. contain more 
.than -x-2 -interrogation positions.- Since probes- :are-. typically 
. ■ 9t21 -nucleotides, and usually all of a probe, is complementary, 
a probe typically has • 1-19 interrogation- positions. Often the 
probes contain a single interrogation position, at or near the 
2 5 . center ;Of probe.-. ■ • . - ■ 

...For each probe in the first set, there are, for .purposes 
of^.the present illustration, three corresponding probes- from 
three- additional -probe sets. See Fig . :1 -. . Thus , . .there are 
four probes corresponding to each nucleotide of interest in 
30 " the .reference ' sequence; Each of the four -corresponding probes 
has an interrogation position aligned with- that nucleotide of 
interest. •- .Usually, the- probes, from the three additional 
probe .sets are - identical to ■ the., corresponding, probe . from- the 
first probe set with one' exception .« -:The exception-is that at 
.35 . least -one ,r( and -often only one.) interrogation- position, which 
occurs -in- the same position -in. each of the-four. corresponding 
probes from- the -four probe sets,. .is- occupied by, >■ 'different 
nucleotide in -the four -probe sets-. -For- example-,- rf or an.' A 
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nucleotide "in the "ref erence 'sequence/- the corresponding probe 
from the first probe set has its interrogation position-' 
occupied by a T, and the corresponding probes front the 
additional three probe sets have their respective ■ 
5 interrogation positions occupied -by A-, C,' or G, a different 
nucleotide in each probe; Of- course,-, if a 'probe- from the 
first probe' set comprises trailing or flanking sequences* 
lacking complementarity to' the reference sequences '(see ' 
Fig: 2) /- these sequences need not be- present in corresponding 
10 probes 'f rom" the three additional sets." ' Likewise -'corresponding- 
probes from the three' additional' sets can contain- leading or 
trailing sequences outside the seginent"^ of^' complementarity that 
are not present in the :corresponding probe from* the * first 
probe set. ' Occasionally, the probes from' the- additional three 
15 probe fset' are- rdenticaT --(with the exception - of interrogation 
position Cs) )- to a 'contiguous -subsequence of the-' f ulT '-^ 
comp'Lementary seginent of - the corresponding probe from* the 
■ . f irst* probe- set.- -In" this case, the subsequence includes- the 
. interrogation position "and usually differs from the "f ull- 
20: /length' probe only * in' the omission' of dne'^^ or' both- terminal 

nucleotide's" from the ' termini ; of a" segrheht" of complementarity . 
That is,> if' a' probe' from- the first probe set has a- segment of 
complementarity- of length n, corresponding probes from" the ' 
other • sets will' usually -include a subsequence of the " segment 
25 of at least length n-2. Thus, the subsequence 'is^-'usually at ' 
least'. 3, 4,- 7 r~ -15 > ' 2 1 or- 25 nucleotides long, -most 
typically;, ■ in the range ' of ' 9-21- nucleotides • - The - subsequence 
should be sufficiently long to allow'a'* probe to hybridize 
. detectably more* strongly to a variant of " the- reference ' * 
30 . sequence 'mutated' at 'the' interrogation position "than' to the - • 
reference sequence. • . ' v ■ 

: The probes - can' be oligod'ebxyr iboniicleotides ^of - 

. oligor ibonucleotides , ' -or any- modified- forms - of these polymers 
. that: are. capable of - hybridizing with a target nucleic 'sequence 
35 ■ . by complementary base-pairing*.- Complementary 'base -pairing 
• means- sequence-specific- base -pairing Which includes "^e;g;- 
: VJatson-Crick base' pairing as'''well" as other ^ 'foriris^of •'-base' - 
pairing/ such-'as' Hoogsteeh- base pairing.' "Modi-f ied' -*f-brms 
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' include" -2 • -O-methyl oligorrbdnucleot ides. ' and so-called : PNAs , 
in which oligodeoxyribonucleotides are linked via peptide 
bonds - rather than, phophodiester -bonds The probes can be 
• attached by any linkage' to a support (e.g. , 3 ' , 5* /or .via the 
-base) 3 ' -attachment is more usual as this orientation ds 
compatible 'with the preferred chemistry for solid phase- 
synthesis of oligonucleot ides . ^ ....... 

- The number of probes . in the f irst probe set ( and as a 
consequence- the' number of probes in additional probe rsets) 
depends on the length of the reference sequence/ the number of 
nucleotides of interest in the reference sequence and- the 
number of interrogation' position's per probe. ' In general, each 
nucleotide of interest -in the reference sequence- requires the 
- same- interrogation position in the four sets of probes.-^' 
Consider, ' as an example, a reference sequence of 100 --"^ 
nucleotides, 50 of ^hich are. of. interes;t:, vand.: probes each 
having a single interrogation position . In this situation, 
the first' probe set requires fifty probes, each having one 
interrogation position corresponding to a nucleotide of • 
-interest in the reference sequence.: The second ,::third rand 
^ - fourth probe sets each have a corresponding, probe : for ^ each 
probe in the first probe set, and so each also contains -a 
total of fifty probes. The identity-.of each nucleotide of 
interest in the reference sequence, is determined by comparing 
. the relative hybridization signals at four probes having 
interrogation positions- corresponding to that nucleotide from 
the four, probe- sets.- * ' . ' ' - ■ \ ' 

-In ' some reference sequences , every nucleotide ' is , of 
' interest.' In other reference sequences, only certain portions 
/iri: vhichv variant sr^e;.^ mutations or po^l^ - ' 

concentrated are of -interest. In other reference sequences, 
. only particular- mutations ,or polymorphisms and immediately 
adjacent ■nucleotides- are of interest. Usually^ the first 
probe set has interrogation positions selected to correspond 
to at least -^a nucleotide-- (e'.^g. , 'representing a point- mutation) 
and' one: -immediately 'adjacent nucleotide: ' Usually-, the'probes 
in the f irst -'set-have -interrogatidn positions corresponding to 
at'least -3*,' 10 > -50, 100, lOOO:, or 20;000 'contiguous:.^;--: .: 
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1 nucleotides. The .probes usually have interrogation positions 
corresponding/to at least 5,- 10, 30, 50, 75-, 90, 99 or 
sometimes 100%. of the nucleotides in a reference sequence. 
^Frequently / the probes in "the f irst, probe set completely- span 
5 ' the reference sequence -and overlap, with one another, relative / 
to'the reference sequence. For example, in one common • . 
arrangement each probe in the f irst-probe set differs from 

■ another probe in that set by .the omission -of. a- 3 base 
complementary .to the reference sequence and .-the acquisition of 

10 a ' 5 ' .base complementary to the reference sequence.- ..- See:. 

Fig. 3. ' . • ^ ' - / - . ' : : - ^ r : ^ . : ' ; . 

For conceptual simplicity,: -the probes in- a set- are . 
usually: arranged in . order of the sequence in a lane across the 
chip.. A- lane .contains a series of overlapping- probes,- which 
15 represent or tile across./ the selected reference -sequence (see 
Fig. :-3) .• ..The components of the four sets of probes, are* 
/.usually laid down in four parallel- lanes / collectively * - 
-constituting a row in the horizontal direction and a. series of 
4 -member cco.lumns in the vertical .direction . -.Corresponding - 
20 probes from the four probe sets (i , e .•>• complementary-^ to the 

same subsequence of the- reference sequence) occupy a'^column, . 
Each probe- in a lane usually differs from its; predecessor in S^- 
. the lane -by the omission of a base at one end and the ¥ 
.inclusion of additional base at the other end as shown in 
25 Fig. .-3. -'- However, this order ly progression of probes can. be 

interrupted by the* inclusion of control probes or omission of 
probes in certain • columns of the array.* Such columns serve as 
controls, to .orient the chip/ or. gauge the background, which 
■■can\include target sequence nonspecif ically bound to, the* chip. 
3 0 The probes sets, are usually laid down- in .lanes; such that\ 

:all probes:' having an -interrogation position- occupied .-by- an A 
form an- A-lane, all probes having^ an interrogation- position 
-occupied by a C .form a .C-iane,- all probes having -an- . / 

■ 'interrogation position occupied by- :a G- form -a-,G-lane, - and all 
35. . . probes; having an ■interrogation position occupied, by a T..-(or U) 

' • rf orm.-.'a T- ..lane * (or . a* :lane) ;. -Note that in this r arrangement 
r.there ds: not . -a .unique correspondence* between probe sets ; and 
lanes .'. \Thus-, :jthe .probe -from .the first probe set is ; laid, down 
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-in .the. A-^ane, rC-lane, - A- lane A-lane and T-lane f or .^the five 
columns in Fig. 4. The interrogation position on^a. column of 
probes corresponds to the position, in the. target , sequence 
whose identity is .determined .from analysis of hybridization to 
!the probes in that column. Thus, I . respectively 
correspond to N1-N5 in. Fig . 4 . .The interrpgation position can 
be anywhere , in a probe, but is usually at or near the -central 
position of the probe to maximize, differential hybridization 
signals between, a perfect match, and a single.-rbase. mismatch . . 
For., example, for- an il mer probe, the central, position .is the 

sixth nucleotide.: 

, : , . . Although the array of .probes is usually ^ laid .down ±n rows 
and, columns as described above/ such a^ physical arrangement of 
probes.. on the chip -is. not essential.. Proyided that; the-/ 
spatial, location of., each probe in an ^rray xs known., - the data 
- from the ^probes ..^ tp.^yield; ^the 

sequence of a ^ target irrespective of the physical, .arrangement 
of the probes on a chip. In processing the data,. the . 
hybridization signals from the respective probes, can be. 
reasserted.. into„ any. conceptual .array, desired for subsequent 
data . reduction., whatev^er the physical, arrangement, of probes on. 

the. chip. . , ^ . - - . V *. . 

..A range of lengths. of probes can be employed in the 
chips,. As noted above, a probe may consist exclusively of a 
complementary segments, , or may have one or more complementary 
segments juxtaposed by flanking, trailing .and/or^ intervening 
.segments.,. In the. latter . situation, the total, length of j - 
, complementary segment.(s) ..is .more important that the length of 
the, probe. In functional terms,, the complementarity , , 
segment (s) of the first probe sets. should . be suff iGiently long 
to allow, the. probe, to. hybridize detectably more .strongly to a 
. reference - sequence, compared, with a., variant of the reference 
.including L a single base mutation^at the. nucleotide^ 
.corresponding to , the interrogation, position of^. the probe . 
.Similarly,, the. complementarity, segment (s)^ in. .corresponding 
.probes., f rom, additional* probe sets should, be suff iciently long 
. to..allow,,a . probe.. to., hybridize de.tectably^ ^P^.^ ^'^^prigly .to a 
variant of the reference sequence having a single nucleotide 
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substitution at 'the interrogation' position relative to *the 
reference sequence. A probe -usually has a single ■ 
complementary segment having a length of at least 
3 nucleotides , and more usually at least - 5 , '6; • 7 / • 8 , 9 , 10 , 
11, 12, 13/ 14',- 15, 16, 17v 18, 19,- 20,- 21, 22, 23, 24,* 25 or 
'30' bases exhibiting perfect complementarity (other- than " 
possibly at the interrogation position (s) depending on the 
probe- set) to the reference sequence. In bridging- strategies, 
where more than one segment' of complementarity is present, 
each segment provides at least three complementary nucleotides 
to the reference sequence and the combined segments provide at 
least two segments of three or a total of- six complementary 
nucleotides. 'As in the other strategies/ the combined length 
of ^-complementary segments is typically from 6-3 0 nucleotides, 
and preferably from about 9-21 nucleotides. - The two'- segments • 
are often approximately the same length.' Often; the probes 
(or segment of complementarity within probes) have" an* odd 
number of bases," so that an interrogation position can occur 

in the" exact center of the probe: ' ' . . . . .. . j. 

' ' • • ^ In some chips,- all* probes are the^- same' rength; Other 
chips employ different groups of probe sets, in' Which'^case the 
probes are of the same size within a group, but differ between 
different groups- - For ■ example , some chips have one group 
comprising four sets .of probes as described " above in" which all 
the -probes are 11 mers; together with a" second group- 
comprising four sets' of probes in* which alT of- the probes are 
13- mers . Of course , additional ' groups- of probes ' can ' be added . 
Thus, some- chips contain,' e.g. , ■ four groups of probes having 
sizes of -11 mers >• 13 mers,. 15 mers and 17 mers Other" chips 
have different size probes within the same' group of four probe' 
sets. -'--In these chips,' the probes in* the 'f irsf set can vary in 
length'- independently • of each' other. " Probes *-in ' the' other sets 
are usually the- same- length as the probe occupy ing the ' same 
column from the first set. However, occasionally different 
lengths- of - probes can be- included at the 'same' column* position ' 
in-'the' four lanes;- - The different- length" probes are^ included 
•to equalize hybridization signals - from *• probes irrespective of 
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. Whether, :AtT or .C-G -bonds:, are, f ormed , at , the interrogation, • 

position.. ■ ■ ■ ■■■ ■ "■- " • - ". ■■ • ■ ' • 

The length of . probe „can',be important .. .in distinguishing 

between. .a ..perf ectly, matched probe and . probes showing a single- 
base mismatch. with. the target sequence. . .The discrimination is 
usually .greater for. short probes. Shorter probes . are usually 
also less susceptible to formation of secondary .structures . 
However, .the absolute . amount of target sequence bound , and 
hence the signal,-, is greater .for larger probes ... .The. probe 
length representing the .optimuro compromise.. between_,these , ■ . 
competing considerations may vary depending on. iJiter alia the 
GC . content of a .particular region of the .target DNA sequence, 
secondary. ..s.tructure ,. synthesis ef f iciency and cross- , 
.hybridization. , .In some regions of -the target ,.. depending, on 
hybridization conditions, .short probes (e . g ,,. 11 .•mers) , may 
. provide .:lnf.ormat:ipn,.th^t is iriaccessi.ble.;.from . longer ..probes 
(e.g., 19 mers) and vice versa.. Maximum . sequence, information 
can be read by including several groups of different sized 
. probes on the chip as . noted above. However, for .many regions 
. of^the target sequence,, such a stra-tegy, provides redundant 
.Anformation. .in that the same sequence as. .read multiple times 
from the different groups of probes. . Equivalent information 
.can.be obtaine.d from a single group of different -sized, probes 
in which. the sizes are selected to maximize. readable sequence 
at particular regions of the target. sequence.. The appropriate 
size of ■probes at different regions of. the -target sequence can 
be de.ter.mined from., .e - g .. , Fig. 12 , . which compares ,-the .. 
rea.da.bility .of diff.erent -sized probes . in different regions of 
a target The .strategy of . customizing probe length within a. 
sing le group of -probe sets .^minimi z es, the ...tota 1. number of 
probes required to read . a particular . target sequence . . T.his 
leayes.-.ample capacity -.f or the chip .to . include .. probes, tp. other 
r.eference r.se.que.nces.. .. ■ ■ . . . .-. ■ ... ' . - 

.. . , Th.e invention provides an optimization -.block .which, allows 
systematic, variation . of probe .length , and interrogation , 
posit:ion.:.to .optiniize .-the .selection of probes .for -.analyzing a 
particular.-nucleot.i.de .in a ...reference 'Sequence. : ..The ..block 
comprises ■alternating ^columns of probes, complementary to . the 
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wildtype target and probes -complementary: to -a specific . 
mutation. The interrogation position is varied between 
columns and probe length ■ is - varied down a column. , 

- ^ Hybridization of the chip to the reference sequence or the 
5 mutant form of the reference sequence ident if ies -^the probe 

- length and - interrogation position providing the .greatest:.' 
differential hybridization • signal .* 

The probes. are designed to be complementary to either 
strand of the reference sequence (e.g.-,- coding or non-coding). 

10 Some chips contain separate groups of probes, -one . . . 

complementary tothe coding strand/ the other -complementary to 
the -noncoding -strand. Independent analysis of coding , and 
noncoding strands provides largely redundant inf ormation^. 
However the regions of ambiguity in reading the coding, strand 

15 are not-always -the same as those in reading the noncoding- 

strand. Thus,- combination of the -information from coding' and 
noncoding strands -increases the overall accuracy of 

sequencing - - ^ 

' Some -chips contain additional probes ■ or -groups of probes 

20 designed to be complementary to a second reference sequenceV' 
The- second- reference ^sequence is often a subsequence of -the 
first - reference sequence bearing one or -more commonly. - - 
occurring -mutations or interstrain variations .- The second ' ' 

- group - of probes is designed by the same principles, as- ^ 

25 -described above " except that the- probes exhibit -complementarity 
to the- second reference sequence. —The inclusion of a second 
group- is particular useful for analyzing short subsequences of 
the primary refereince sequence in which -multiple mutations are 
expected to occur -within' a short distance commensurate- with 

30 the length of- the probes (i.e. , two or more mutations within 9 
to 21-- bases) . Of -course, the same ^ principle :can -be -extended 
to provide chips containing groups -of probes f or - any-. ^ number of 
reference sequences. Alternatively, the chips may -contain 
additional probe (s) -that do -not form part of a tiled. array as 

35 noted - above, ■ but rather serves - as probe(s)"f;or a -conventional 
. reverse - dot -blot - -For example the ■ presence -of., mutation* can 
be detected from ' binding - of -a - target - sequence to a.^single. 
" "oligomeric-'probe -harboring the mutation. - Preferably., . an- . 
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additiona.l.prpbe containing,the, .eqvi,ivalen^ r.egipn .of .the. 
wildtype sequence is. included as a; control-. ■ r 

.■ The,- chips are read by comparing the intensities of 
labelled -target bound, to the .probes- in an- array. - 
specifically; . a- .comparisbn. is performed, between each lane of . 
probes (e.g., A, C, and T lanes) at each- columnar, position 
' (physical or conceptual),. For -a particular columnar position, 
the lane showing. -the greatest hybridization signal, is. called 
as .the.nucleotide present at the., position in the target ., 
sequence. corresponding to -the interrogation position in the ■ 
probes.- . See Fig. 5.. The corresponding position in the, target 
sequende is that aligned with .the interrogation position in 
corresponding .probes when- the probes and target are. aligned to 
maximize compleroentarity . Of .the four probes in a column, 
only one can exhibit, a- perfect match to. the target sequence .. . 
i whereas .;the.. others ..usually exhibit, at least .,a ^ne. .base- pair 
mismatch. The probe exhibiting a. perfect match .usually-.^,, 
produces a substantially greater hybridization .signal. than the 
- other three probes in the column and is thereby, easily . - 
id.entified.- . Howeyer,.. in some regions of..,the target .sequence, 
. the .distinction between a .perfect match and a .one-base ... 
"mismatch is less clear. Thus, a call ratio is.established to 
define the ratio of signal from the best hybridizing probes to 
the- second best hybridizing probe that must be exceeded for a 
25 particular target position -to be read from the probes. , A high 
call .ratio ensures that few if any errors are made .in calling 
target nucleotides, ., but can .result in. some nucleotides .being 
scored-. as .ambiguous, which could in fact be accurately read. 
A lower, call Iratiq results in fewer ambiguous calls, but can 
3Cir . result in more erroneous -calls .a 
• .. cail ratio of ' 1-. 2 virtually all calls are accurate: However, 
a smair but significant number of - bases (e.g.,,- up to, about 
. .10%), may have to. be scored- as ambiguous. . - ., . - 

.„• -. -.Although small regions of the target sequence -can , 
35 sometimes .be: ambiguous , these regions usually- occur at .the 

- same or similar- segments - in dif f erent, target secjuences ... Thus , 
. ' . f or-precharacterized .mutations, it. is known in ..advance whether 
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that mutation^ is 'likely ' to -^occur- -within a^ region^of, - 
unambiguously determinable sequence. ' ' ' . . . • 

An array of probes -is most useful for analyzing the 
reference sequence from- which the- probes^ were' designed and- 
5 variants of that sequence^ exhibiting substantial sequence ^' 
similarity with' the reference sequence -(e.g.-, several single- 
base mutants spaced over the reference sequence) . - When-' an 
array is used to analyze the exact reference sequence from 
which it was 'designed,* one- probe 'exhibits • a 'perfect match to 
10 ' the reference sequence*; and the other .three-' probes' in^ the same- 
column exhibits single-base mismatches. Thus discrimination 
" between* hybridization signals is usually high and accurate 
sequence is obtained. High accuracy is- also obtained when an 
array is used for- analyzing a target- sequence' comprising-' a 
15 variant of' the' ref erence sequence that has a single- mutation 
relative to the reference sequence,- or several widely spaced 
mutations relative to the reference sequence. • At different 
mutant *loci,^ one -probe exhibits a perfect match • to the target, 
and the other three probes occupying the same column exhibit 
20 'single-base mismatches,- the difference (with respect- to - " 
analysis *^^of the reference sequence)^ being the -lane in which 
• the perfect match occurs.- . .. - i . / . ... , 

^ ^ ' For'target sequences showing a high degree of divergence 
■ from the reference strain or- incorporating several- closely"' 
25 spaced mutations from the reference strain, 'a single group of-*- 
^probes (i.e., designed with respect to a single reference 
sequence) will -'not always provide accura'te sequence for the 
highly variant region of this sequence.^ - At some -part-icular 
columnar positions,* it- may be that no single -probe exhibits 
30 perfect^ complementarity to the target arid that- any comparison 
must be "based on different -degrees of mismatch between the 
four probes.- Such' a comparison does hot always ■ allow the 
target nucleotide corresponding to that- columnar position to 
be called.' Deletions in target sequences' can -be^ detected by 
35 ' loss- of -signal "from probes having interrogation positions 

encompassed by the deletion ' However signal- -may also - be lost 
• -from" -probes having interrogati-on - position's- -closely p'rox to 
the deletion resulting in some regions of the target sequence 
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:that: cannot -bey read. . Target sequence; bear ing insert ions will 
also: exhibit short regions including and ;proximal to.- ;the 
insertion^ .that-usually cannot^ be .read.- . . : ^ 

The presence .of short regions of diff icult-to-read -.target 
because- of closely, spaced . mutations: -insertions or deletion,- 
■ does not; prevent determination - of the remaining sequence of 
.the ..target: as different regions of ■ a target sequence :are 
.determined:independently . Mpr^eover, such ambiguities ..as might 
result-^from /analysis- of * diverse -variants yith a single group 
,of probes can. be avoided by including .multiple groupsv-of probe 
sets on. a chip. . .. For. example, one group of probes can be- 
designed: based on a . full-length reference, sequence, and the 
. other groups on subsequences, of the .reference sequence ./ 
dncorpbrating- frequently -occurring, mutations: or strain 
variations ... ' ^ - '.^ ^. ^ *■•■■- ■ - ■, ^ . .. 

. V- :^- Ar particular advantage of -^he: , present sequencings /strategy 
over conventional sequencing methods is the capacity./^^ - 
simultaneously to.: detect and quantify proportions of . multiple 
.-.target . sequenc.esv Such capacity is. valuable, e.g.v-for 

diagnosis- of :. patients who .- are . heterozygous.: with respect to a ir 
,'gene or.-who are inf ected/with a virus , such as HIV:, - which is 
usually present in several polymorphic forms. Such capacity 
is also useful in analyzing- targets: from biopsies of tumor 
cells and surrounding tissues . The presence of multiple 
target , sequences is,' detected; from the relative signals of the 
,four :probes at .the array columns .corresponding to the: target 
nucleotides at "Which: diversity occurs. • : The relative. signals 
at the.^four probes for- the mixture under ^. test ,are compared 
with the' corresponding signals. from a homogeneous reference 
^ sequence . a ^signal f rom a probe- that is r-. - - 

mismatched with respect to the reference sequence, -and a 
corresponding decrease in the signal from the probe which is 
matched with the reference , sequence signal ■ the", presence of a 
^mutant. strain , in the mixture.: -.The . extent, in- shif t: in 
hybridization signals of . the^. probes isi.related: to .the. /. 
proportion * of a: target ' sequence in the: mixture;.^ : Shif ts:^;'in 
relatiye\hybrid'ization\signals can -be quantitatively .related 
\ to. proportions of reference: and:mutantr sequence : by. .prior. 
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■ ■ calibration "of '-the chip- with seeded mixtures, of • -the. mutant and 
reference sequences-. By' this means', a chip can be . used to 
detect variant or mutant strains- constituting as . little as 1, 
■ 5,-20-, or 25% of 'a mixture of stains 
5 .Similar principles allow the - simultaneous analysis of 

multiple target sequences, even when. none is identical • to. the 
reference sequence. For example, with a' mixture of two. target 
sequences bearing first and second, mutations there would be a 
variation in the hybridization patterns of probes:: having . 
10 ^^ interrogation, positions corresponding to the' first.'and second*, 
mutations- relative to -thevhybridization pattern with the., 
reference sequence.- ■ At' each' position , - one of . the. probes, 
having a mismatched interrogation position relative to the 
reference sequence .would show an increase in; hybridization 
15 signal, and the probe having a matched interrogation position/ 
.. . relative, to the' ref erence sequence wpuld- show , a .decrease in 
hybridization signal. Analysis of the: hybridization pattern 
- of- .the"' mixture of mutant : target", sequences , preferably in 
comparison with the hybridization pattern of the reference 
20 - sequence", -indicates the. presence: of two' mutant -target ' 

- sequences; thevposition and nature - of - the mutation in^- each ' 
strain,, and the relative proportions of : each strain.. 

In a: variation of the above method,' the different ^' 
components in-, a- mixture of target sequences are dif f erentia^ily 
25 labelded- before- being applied to- the. array. . -For example; a ' 
* . variety of fluorescent labels^ emitting at"*different 'wavelength 
are available.- ^The^use.of differential., labels allows. 
independent analysis* of different targets, bound simultaneously 
to. the ' array . /For example, the methods permit ' comparison of 
30 target sequences: obtained from-' a patient ^at different stages ' 
of a-.disease., " /■ ■ . - : • :. ..:';:..---:■•. 

-.: :•■ , V- - . 2 . ' Omission- .of - Probes : : . :• : . : .. / 

' The; general" strategy" outlined, above employs four probes 
35 to read each nucleotide of interest in; a*- target : sequence .t One 
^ probe., .(from the* first probe: set ) shows .:a perfect: match ; to the 
r.eference . sequence: and .the other three."'probes.; (-f rom^the ' 
second, .'.third- and; fourth., probe, sets.) . exhibit -a. :mismatch< with 
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the reference- sequence • and .- a: perfect, match . with-, a,, target; 
sequence- bearing a: mutation atrthe nucleotide, of- interest. 
Thecprovision.of three probes from , the: second,- third . and. 
fourth -probe r.sets allows detection of • each of;; the three ; 
possible nucleotide, substitutions of . any nucleotide - of,- . . 
interest. ■. However, ■ in -..some reference ■ sequences or, .regions of 
reference sequences,, -it is known in advance that- only.. certain 
mutations, are. likely, to, occur. Thus, for example at ; one site 
it might :,be. known that an A nucleotide- in the reference-:,, 
sequence may exist ■ as a T . mutant in some.; target, sequences but.^:. 
is unlikely to exist. as a cor G mutant. Accordingly, for 
analysis of this -region of , the reference- sequence one might 
include only the -first and,second probe sets, the first probe 
set -.exhibiting perfect .complementarity to. the -reference:.. 
. sequence and the second probe set having .an, interrogation 
.position occupied..by...an,inYari^^ the 
T- mutant) . , -In other situations, one might include, the first, 
second and third probes sets (but not the- -fourth), for 
detection of a wildtype nucleotide in the reference -sequence 
2 0 and two mutant variants thereof .- in target,,, sequences. In some. . 
-chips , probes ■ that ..would-; detect • silent- mutations.- (i .e . , not 
affecting amino acid sequence) are omitted... 
. .. ...In some chips, the probes from the first probe set- are 

omitted corresponding to some or all positions of the 
25 reference sequences. Such chips comprise at least- fwo- probe , 

sets -The -first probe ..set, has a. plurality of probes ,Each 

-probe ^comprises a segment exactly complementary .to. a. .. 
subsequence of a reference sequence -except in. at ..least one 
.-interrogation position. A -second -probe set has a - 

The .corresponding probe in the second probe set is. ..identical 
. tq...a . sequence -comprising the corresponding ■ probe . form-^the 
; probe set-,or -a subsequence -thereof .that -includes. :the at 

least one. ( and -usually ...only one) interrogation position ■ except 
that .the. at ..least, one interrogation position is -.occupied by a^ 
different' nucleotide in ieach.of the -two .-.corresponding, probes 
from .:the. :f irst .-and .-second -.probe .sets . . . ,A third.-prpbe, set,- if 
present:,-, also .comprises , -a .corresponding ^probe, for .each.^probe 
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in "''the ' first' probe- set except- at the at^ least one . v , 
interrogation position which differs in the .corresponding 
probes from the three sets. Omission of probes having a: 
segment exhibiting' perfect complementarity to the reference 
sequence results in loss of - control-^-inf ormation , i.e., -the 
detection of nucleotides in a target sequence that -are the 
same: as those in a reference sequence. However, similar*, 
information-can be obtained by hybridizing a chip: lacking 
probes ' from the first probe -set to both target and reference 
sequences. The hybridization can be performed -sequentially , 
or concurrently, if the target and reference -are ^ - 
differentially-labelled. ^In this situation, the presence of i 
mutation is detected by a shift in the -background 
hybridization intensity of the reference -sequence ^to a - 
perfectly • matched^ hybridization 'signal -of the target sequence, 
father- than by -a comparison ' of the hybridization intensities 
of probes from the 'first set with -corresponding probes from 

the second; third and fourth sets. ■ ' - 

. . f ■ , .. - ■ . ■ . • - - . 

3v -Wildtvpe Probe Lane • - •- - * . .r-.y ■ 

When -the chips comprise four probe sets>" as -discussed 

supra, and the probe * sets - are laid down in four lanes, an A^' 
lane;- a C-Iane, 'a G lane-and a T or U lane, the probe having a 
segment exhibiting ■ perfect complementarity -to a 'reference 
sequence -varies between the four lanes from one column to . 
another This does riot present any significant difficulty in 
computer- analysis of the -data' from the chip. However',- visual 

inspection of the hybridization -pattern ' of the -chip is 

sometimes facilitated by provision of an extra- lane -of probes, 
in ■which- each- probe has a segment exhibiting ^perfect - 
complementarity to the -reference • sequence . — See Fig ' 4 . ■ - ^This 
segment ■ is' identical to a segment from one -of the -probes in 
the- other four lanes (which' lane- depending on the column 
position) v The - extra • lane'- of probes (designated the wildtype 
lane) hybridizes- to a target -sequence -at all nucleotide * 
pbsitions'^ except those in which deviations -from the' reference 
sequence * occurs : The hybridization ' pattern "of:. -the>wi id-type 
lane-'- thereby provides ' a simple visual - indication of . mutations . 
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4 .- .peletiionv"- Insert ion- and Nult:iple"Mut:at:ion.: Probes 
^ ^ Some-chips provide an additional probe set specifically 
designed for -analyzing . deletion mutations. . The additional 
probe set comprises a . probe corresponding to each probe- in the 
5 first probe set as. described above.- However a probe from the 
additional, probe set differs from the corresponding/. probe in 
the .f irst; probe , set in. that the nucleotide , occupying*, the ' 
interrogation: position is deleted :in the probe from.rthe . ' 
additional probe set. See Fig. 6., Optionally, the probe from 
10 the additional -probe set: bears an- additional ; nucleotide at one: 
of its termini relative to the: corresponding probe ^-from . the 

- first probe . set. -/The probe .from the additional prober set will 
hybridize -more strongly than the corresponding probe .from the 
"first probe set . to a target sequence having a single .base 

15 , , dele tion-at- the nucleotide .corresponding -to the -interrogation^ 

only- the interrogation position ,^ but also an adjacent - 
nucleotide is detected. y - . ' 

Similarly, other chips provide additional' probe sets for 
20 . analyzing; insertions. ■ For example, one additional probe set ^ ' 

- , has- a probe^ corresponding to. each probe in the first probe set 

as^ described above . / However , the probe in the additional 
probe set has an extra ,T nucleotide inserted adjacent to the 
interrogation position. , See. Fig. 6- Optionally/ the probe 

25 has one fewer nucleotide at one of its termini relative to the 
corresponding probe from the- first probe s.et. The probe: from 
the - additional probe. set hybridizes more strongly, .than the 
corresponding^ probe ^rom the .first probe set to a.- target 
sequence having- an A nucleotide inserted in a position. 

30 ^ adjacent tOL^^ t to the interrogation-vposition. - 

Similar- additional probe sets are constructed .having^ C, G or 
T/U nucleotides inserted adjacent to the interrogation-'-' 
-position.- Usually, - four such probe' sets, one for: each 
nucleotide,; are. used^ in. combination. * ' * ' - ' " 

35 _ -other; chipS; provide . additional, probes.' (multiple-mutation 

probes): for - analyzing ■ target -sequences having multiple-, c'losely 
spaced., mutations:. , - . A; multiple-mutation probe is.' usually ■ 
identical;-.to. a. corresponding probe^" from" the- first 'set' as-'- 
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described above;, except-; in; the' base - occupying: the 
interrogation position , and- except at one or more^ additional 
positions, corresponding to nucleotides 'in which substitution 
may. occur in the reference sequence. - The one - or 'more 
additional positions in-: the :inultiple mutation probe are . - 
occupied - by • nucleotides complementary to the nucleotides 
occupying corresponding positions" in .the' reference - sequence 
when the possible substitutions have occurred. — 
5 . Block Tiling ^ v ■ . 
, : As noted in -the. discussion of the general tiling - - 
strategy a -probe in the first probe -set sometimes' has more 
than. ^one : interrogation position. - In - this situation, a probe 
in the first. probe set is sometimes matched with multiple 
groups of at least one, and usually; three additional probe 
sets.;,*:See Tig. -7.; Three additional probe '-sets' are -used' to 
. allow .'detection of the three possible nucleotide substitutions 
at any : one 'position:. If only - certain types of substitution 
are likely to occur (e.g., transitions), only one or two • 
additional: probe sets are required - (analogous to the use of- 
probes; in . the* .:basic tiling strategy ) . "-^To ^ illu*s*trate"'f or ' the 
situation where a group comprises three additional' probe 'sets , 
a .-first such group comprises second, third and fourth probe- 
sets,, each ;of which has a probe corresponding to -each probe""' in 
the first probe set. The corresponding probes from the 
second, .:third and fourth probes -set's differ from the ■ 
corresponding probe in. the' first set at* a first of'the- 
interrogation positrons. ' Thus, the relative hybridization 
signals- from corresponding -probes from the first, second;' 
third and fourth: probe- sets -indicate - the identity of -the - • 
nucleotide in a target . sequence corresponding' to- the- first 
interrogation position.: A second group of three- probe - sets 
(designated- fifth,, sixth and seventh ' probe sets) ; ^ each also 
have ..a probe corresponding to each probe in the first probe 
set. These corresponding, probes differ* from that in the ' first 
prpbe set; at a second, interrogation ' position . - - The relative 
hybridization . signals - from:, corresponding, probes' from 'the ' 
first}, -.fifth; sixth, and- seventh probe sets- ■indica:te -'the 
identity of ; the,- nucleotide -in the:, target -sequence-:^ *; 
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-corresponding :-to the/ second- interrogation .position..; As. noted 
above, the probes ^in the first. probe set of ten have seven or 
more: interrogation positions. ; If there are seven . - .y- 
interrogation. .positions, .there -are -seven ^groups of .-three . 
additional probe sets,. -each group of three probe . sets serving / 
to identify the nucleotide corresponding ^tOr one ^ of -the seven 

interrogation positions- * , - . - : :-r ; : / 

..Each block, of probes allows short regions of a, -target 
sequence to be read-. - For example for a block of ..probes 
having- seven interrogation positions, seven nucleotides, in the; 
target sequence can. be read. ^ Of course, a chip can contain 
. any number, of blocks depending on how many, nucleotides of the 
target are of interest . ^ The hybridization^ signals; for each 
block -can be analyzed Independently, of any other .block.,; The 
block tiling, strategy can also .be. -combined .with other; tildng / 
-.^.strategies.,^w;ith..:.difXerent parts^^^ , 
sequence being tiled by different strategries. . 

The block tiling . strategy of fers two- advantages-, over the 
basic strategy in which each probe- in the first set. has a 
single, interrogation position . One .advantage is^~.that;,the same 
s^e,quence information can be obtained ,f rom ;fewer, --probes . ^, A 
second advantage., is, that each of the probes .constituting a 
block., (i..e • , a probe from the first probe set and a 
corresponding probe from each of the other probe sets) . can 
have identical 3 • and 5 • sequences, . with the variation ■ - 
conf ined- to . a central segment^ containing^ the interrogation 
positions.'.. The identity, of 3. V. , sequence between different, 
probes simplifies the strategy for solid phase- synthesis, of 
the probes on the chip and results in more uniform deposition 
of .the dif f erent probes on the-chipv thereby in turn- - 
increasing the ..uniformity of signal to noise ratio, .fpr - 
different regions of the^chip. A third ■ advantage- is . that • 
greater signal uniformity is. achieved -within • a- block.. - 

, . . . . 6. Multiplex Tiling - . ^ --^ , . ■ / 

Jn-the-block tiling : strategy discussed .above , ther - 
identity.,of - a -nucleotide in ; a... target or^: ref erence sequence is 
determined .by comparison- of hybridization .patterns . of,-, one- 
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probe having - a 'Segment - showing -a '^perfect match with that of 

other probes (usually three other probes) showing a single 

base mismatch. -In multiplex tiling, the identity of at least 

two nucleotides in a reference or target -sequence - is • 

determined -by comparison of hybridization signal intensities ' 

of four probes ; 'two of which have a segment -showing perfect 

complementarity or a single base mismatch ■ to the reference 

sequence, and two of which have a- segment showing perfect 

complementarity, or a double-base mismatch to a segment. -'The 

four-probes whose hybridization patterns- are to be' compared 

each have a segment that is exactly complementary to a- - 

reference sequence • except -at two interrogation positions/ in 

which "the segment may or may not' be complementary ' to" the- - 

reference sequence. The interrogation positions correspond to 

the nucleotides in a reference or target sequence which' are ^ 

determined -by the comparison of intensities. The* nucleotides 

occupying the interrogation positions in the four probes' are 

selected according to the following rule. The first 

interrogation position- is occupied by a different nucleoti^ie 

in -each Of -the four probes. -The second -interrogation- position" 

'is also occupied by a different nucleotide ' In' each "^of ' the four 

probes.' - In two of the four probes designated the first and 

second probes/ the segment is exactly complementary to the'' 
^ . . .... 

reference sequence" except at not more- than" one of the two 

interrogation positions. ~In other words, one* of the-- ' 

interrogation positions' -is occupied by a- nucleotide that Is 

complementary- to the- corresponding nucleotide' from the " 

reference, sequisnce-' and the other interrogation position may or 

• may- not be' so occupied-. In the other- two of the'' four 'probes , 

designafed the third and fourth probes; the" segment is exactly 

complementary to- the' reference sequence except* that 'both'"' 

interrogation positions are occupied ' by - nucleotides which' are 

noncomplementary "to the respective- corresponding nucleotides 

in the reference sequence - 

There are number of ways * of satisfying these conditions 

depending- on-' whether- the two' nucleotides in' the" reference 

sequence correspondihg to the'^t'wo Interrogation '^posxtions "are 

the -same or* d-if-feirent. ^ If these-' two.' nucleotides- -are different 
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in the reference sequence (probability 3/4) the conditions 
are satisfied, by. each of^ the two interrogation positions being 
occupied by the . same nucleotide in any given probe. ; For 
example,, in,, the f irst, probe , - .the ;two interrogation positions 
5 would both be A, in.. the- second probe, both would he C, in the 
third -probe/ each would be- G,^ and in the fourth probe - each 
would be, T or U.- ,If , the- two nucleotides in the reference 
sequence correspondingto the two interrogation ^positions are 
different/, the conditions noted above are satisfied by, each of 

10 the interrogation -positions in any one of the four probes 

being occupied by complementary nucleotides. For example, in 
the first probe, the interrogation positions, could/ be. occupied 
by. A and T, in the second probe by C and G, in the third probe 
\hy G and C and ..in the'- four probe,, by T and A . See ;(Fdg.. 8) . 

15 _ , ^ When. the. four probes are hybridized;. to a target that is 

„ • .the .same. ^as.^the xefer^ sequence^ or. dif f ers^^^ 

reference . sequence at . one (but :not both) of the - interrogation 
positions,, two of .the four .probes show a double-mismatch, with 
the target, and .two probes ..show, a single, mismatch, The = ^ 

20 identity of .probes showing these different degrees . of;- mismatch: 
can .be .determined from .the . different, .hybridization signals . \ 
Fxom the .identity of the probes,, showing - the - different degrees 
of mismatch, . the nucleotides occupying - both of the- . - . ^ 
interrogation positions in .the target .sequence can be deduced. 

25 For. ease of illustration, .the . multiplex strategy . has been 

initially described for . the situation where., there. .are two 
nucleo;tides of interest in a. reference .sequence and only- four 
probes_in an . array -/- Pf course, the strategy can be extended 
to analyze any number of . nucleotides in a target sequence by 

30 '^ ^ using additional probes . In one variation, each -pair-of-- 

interrogation, positions is read, from a unique group, of four . 
probes.., In a block . variation , different groups . of ■ four -probes 
exhibit the ^.same . segment of complementarity with : the reference 
sequence, but the interrogation positions move within - a block . . 

35 The block and ..standard multiplex . tiling .variants ^ can of course 
be used in combination .. for. .different - regions .of a reference 
sequence. Either or both variants can also - be - used- in • 
combination with any of the other tiling strategies described. 
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■'^ - 7 Helper Mutations ■ - . . 

■Occasionally STnall regions of a reference sequence give 
low hybridization signal as a result of annealing of probes. 
The self -annealing reduces- the amount- of probe effectively 
available for hybridizing to the target. Although- such " 
regions of -the target are - generally- small and the* reduction *o 
hybridization signal -is usually- not- so substantial- as to 
obscure -the* sequence of this region, this concern can- be- 
avoided -by the use of probes incorporating helper mutations. 
The' helper mutation(s) serve to break-up- regions ■ of internal 
complementarity' within- a probe and thereby prevent "annealing . 
Usually,' one or two helper- mutations are quite- sufficient for 
this purpose. The inclusion of helper mutations can be * 
beneficial in any of the tiling strategies noted above. - In 
'general each- probe having a particular interrogation position 
has-the. same^ helper mutation(s) Thus, such probes have a 
segment in 'common which shows perfect complementarity with a 
reference' sequence , except that the segment contains at", least 
one-helper mutation (the same in each* of the* probes) arid at 
least one interrogation position (different in ~all of the 
probes);.-*- - For example,' in- the- basic tiling strategy, a, probe 
from- the ■ first probe- set comprises a segment containing^' an 
interrogation position' and showing perfect complementarity 
with a reference sequence except for one or two helper"' ' 
mutations/ • The • corresponding probes from 'the second, third 
and fourth probe sets'- usually comprise the same .segment (or 
sometimes a'subsequence thereof Including the helper ' ' " 
mutation(s) and interrogation position) , except that* the "base 
occupying the interrogation position varies in each probe. 
See FigT 9 . - : . - ^ . . . . 

Usually, the helper, mutation tiling strategy as used in 
conjunction with one ' of - the ^tiling strategies described/above 
The probes- containing -helper mutations are used to tile'^ 
regions -of a reference sequence otherwise giving low ' " 
hybridization- signal (e.g-, because of self-complementarity) , 
and the alternative -tiling strategy is used to' tile " ' ' 
intervening regionsl ■ - . . . . .■ ...v.. 
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R. Pooling st :rateaies ; .' ■ ' 

Pooling strategies also employ arrays of immobilized 
probes. Probes are immobilized in. cells of an- array.,.' and the 
hybridization signal of - each." cell can be determined 
independently of any other cell." A particular cell may be 
occupied by pooled mixture of probes.- - Although the- identity 
•of eachV probe in the mixture is known, the individual probes 
in the pool are not separately addressable. Thus, the 
hybridization signal from a cell is. the aggregate of that of 
the different probes occupying the cell In general , a cell 
is scored as hybridizing to a target sequence if at least one 
probe occupying the cell comprises a segment exhibiting 
perfect complementarity to the target sequence. 

A simple strategy to show the increas.ed power of pooled 
strategies over a standard tiling is to create- three, cells 
• each'- containing a pooled probe^having.-a singler pooled i; /. 
position, the -pooled' position being the same in each' of -the 
pooled probes. At the pooled' position,- there are two possible 
nucleotide, allowing the pooled probe to hybridize to two 
target secjuences." In tiling terminology, the pooled position' 
of each probe, is an interrogation position. As will- become 
apparent, comparison ; of the hybridization intensities of the. 
pooled probes from the three cells reveals the identity of the: 
nucleotide in the target sequence corresponding to the 
interrogation position (i.e., that is matched with the 
interroga.tion- position- when the . target sequence -and pooled 
probes are maximally aligned for complementarity)-. ■ ; ' 

The three cells are assigned probe pools that- are - 
perfectly complementary to the target except at the. pooled 
: :p^c5Wition, . which ;isVc3:&c:uFfied by a different- pooled nucleotide 
in each probe as follows: , 
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[AC] = M, [GT]=K, [AG]=R Z:.: -^. - -■ . 

as- substitutions, in the' probe " . . . 

lUPAC standard ambiguity notation) ' 

'.' X - intisrrogation position 
Target: \. TAACCACTCACGGGAGCA 

Pool- r: ■ ATTGGMGAGTGCCC . 

=ATTGGaGAGTGCCC , . 

+ATTGGCGAGTGCCC 

Pool 2 : . ATTGGKGAGTGCCC 
=ATTGGgGAGTGCCC 
• +ATTGGtGAGTGCCC 

P'ooii 3 : " ' ' ATTGGRGAGTGCCC 
• " ' '"-^ ■ =ATTGGaGAGTGCCC 
+ATTGGgGAGTGCCC 



(complement to mutant 't') 
(complement to mutant 'g' ) 



(complement to mutant 'c') *' 
(complement' to. wild : type ' a * ) 



(complement to. mutant .'t'): 
(complement .to mutant , ^c\) 



20 . with 3 . pooled probes, all 4 possible , single .base pair . states 
(wild, and 3 mutants) are detected. . A pool, .hybridizes., with a 
- target., .if some , probe . contained within that pool is., . 
complementary, to., that target,. , . . „ ... . .. ... . . 



25. 



30 



Pool :' 
Target: 
Mutant : 
Mutant: 
Mutaht :' 



TAACCACTCACGGGAGCA' 
TAACCcCTCACGGGAGCA 
TAACCgCTCACGGGAGCA " 
TAACCtCTCACGGGAGCA 



.Hybridization?, „ : 

1 ' " "2' ■ '"' ^2 

-' n ' - ' y - ' n 

y ' ^ ^ 

;y n . • y 
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.-A CjBll containing a pair (or more) of oligonucleotides 
lights . up when a target complementary to any, of the . 
oligonucleotide in the. cell . is . present . ..Using the simple 
strategy, each of the four possible targets (wild^ and three 
mutants) yields a unique hybridization. pattern among the three 

cells . , ... -. . . . 

Since a different pattern of. hybridizing , popls_ is 
obtained for each possible nucleotide in the target sequence 
corresponding to the pooled interrogation position in the 
probes, the identity of the nucleotide can be determined from 
the hybridization pattern of the pools. Whereas, a standard 
tiling requires four cells to detect and identify the possible 
single-base substitutions at one location, this simple pooled 
strategy only requires three cells. 
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-: . • A-.more .ef ficient- polling, strategy. f.p.r- sequence analysis 
is the 'Trellis' strategy. In this strategy,- each- pooled 
. -probe. has a segment of perfect complementarity, to a reference 
, sequence except at three, pooled positions. . . One pooled 
.5 . position is an N . pool... . -.The three pooled positions - may,, or may 
^^not be, contiguous in a probe. The other, -two .pooled positions 
are; selected from . the. group of -three pools consisting ; of ,{ 1) M 
or;K, (2) R.or Y and (3) W or S , where the single. -letters are 
. .MUPAC ^standard ambiguity codes. .The sequence of- a-, pooled 
10 probe is ;thus, of the form XXXN[:(M/K) . or (R/Y) or (W/S) ] [ (M/K) 
or (R/Y) or (W/S) ]XXXXX, where XXX represents bases ^ 
complementary to the reference sequence. The three; pooled 
positions may be in .any order , and may be contiguous , or. 

, separated by intervening nucleotides. For, the two positions 
15 occupied by [ (M/K) or - (R/Y) or (W/S),],, two choices, must be . 

. = made .. . • First , , one must select o.ne, of ■ the-, .following. - three pairs 
of pooled nucleotides (1) M/K, (2) R/Y- and ( 3 ). W/S . The one 
of three pooled nucleotides selected may. be the same or-- 
. different at the two pooled positions. Second, . supposing . for 
20 example, one selects M/K -at one position, • one must then chose 
between M or K. . This choice should result in selection. of a 
. pooled nucleotide comprising a nucleotide that complements the 
corresponding nucleotide in a reference sequence, when the 
probe, and. reference sequence are maximally aligned. The same 
25 principle governs the selection between R and Y ,. .and between W 
and . S.. ' A trellis pool probe has one pooled position- with four 
possibalities,.and..two-pooledipositi.pns, each with two 
possibilities. Thus, a trellis pool probe, comprises a mixture 
of 16 (4 X 2 X 2) probes. Since each pooled position. includes 
3.0 : . .one hucleotide that-complements the corresponding, nucleotide,^, 
. from „the.- reference sequence, one of these: 16 probes has. a 
- segment .that . is the exact complement of the reference - 
sequence. -A. target sequence that is the same- as the reference 
sequence , (i.e. a wildtype target) gives a hybridi.zation 
35 signal, to each probe cell... Here , . as .in . other, tiling methods.,., 
-the segment of complementarity : should be ■ ,suf f ici.ently:. long to 
permit. specific-hybridization of a pooled. probe to- a -reference 
sequence be detected. relative to: a variant of • that- reference 
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^ sequence. - ^ -Typi'cally , ■ the segment of , complenientar ity is about 
9-21 nucleotides. - 

' ■ " A target sequence is analyzed by cbinparing hybridization 
intensities* at three pooled probes, "each having the structure 
'5 described above The • segments complementary to the reference 
sequence present in the* three- pooled" probes show some overlap. 
Sometimes the segments are identical (other than at the.- 
interrogation positions)." However', this need not be the case. 
For example the segments can tile across a -reference . sequence 
10 in increments of one nucleotide (i.e., one pooled-'probe 

.differs from- the next by the - acquisition of one nucleotide at 
the- 5 '-end -and loss of a nucleotide at- the 3' end)-. • 'The three 
interrogation positions may or may not' occur at" the same 
relative positions within each pooled probe (i.e., -spacing 
15 ' ' from- a- 'probe' terminus) . * All that is ' required is- that - one of 
• * the'-three interrogation positions from each of the three 

pooled probes aligns with the same nucleotide -in the reference 

sequ'ehce, and that this' interrogation^ position is occupied by 

ft . 

a different pooled nucleotide in each "of the three probes. In 

20 ^ one- of the three probes, the interrogation' position -is - 
occupied by an N. - In the other two pooled probes^'the 
interrogation position" is occupied by one of (M/K) or(R/Y) or 

(W/Sj v - * . ; • . , . ' 

In the simplest form of the trellis strategy,- three 

25 pooled probes 'are used to analyze a single nucleotide in the*' 
* reference sequence'. -Much' greater economy- of probes is 
achieved when more pooled probes are included in- an array. 
For example, consider an array of five, pooled - probes --each 
having the general structure outlined above . . Three of these 

30 • pooled probes- have an interrogation position that, aligns with • 
-the same "nucleotide in the reference sequence and are used to 
read that nucleotide^ A different combination of "three' probes 
have an interrogation posit ion that- aligns with-a different 
nucleotide in the reference- sequence . Comparison of *. these 

35 ' three probe^ intensities allows analysis of- this second 

nucleotide'. Still another combination of three -pooled probes 
■ from- the set' of- five- have an: interrogation position -that 

aligns* with * a third nucleotide in the ' reference sequence and 
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^^333 . probes , are ^used 5to, analyze. that nucle :Thu^>, three 

nucleotides' in the reference sequence are fully analyzed from 
only five pooled probes . . By ' coiuparison/, the basic -tiling 
strategy would require 12. probes for . a- similar anarlysis. 
: . ....Asvan. example, a pooled probe for. analysis of a target 
sequence by the trellis strategy^ is shown, below: , ; 

Target: : ATTAACGACTCAGGGGAGCTCT / 

.Pool:: . :;c : ^ TGGTGNKYGCCCT : : ! - >^ • - - - -tJ^ 

The pooled probe actually comprises 16 individual- probes : 

TGGTGAGcGCCCT / . . , „ : - 

+TGGtTGcGcGCCCT - - - 

, . . +TGGTGgGcGCCCT ^ - n . : 

+TGGTGtGcGCCCT . ^ . ;^ 

+TGGTGAtcGCCCT 
- . +TGGTGctcGCCCT_^ - ; \ 

+TGGTGgtcGCCCT : ,, . 

+TGGTGttcGCCCT ^ 
; +TGGTGAGTGCCCT . - - - - - - ; 

+TGGTGCGTGCCCT . _ ; 

4-TGGTGgGTGCCCT \ 

+TGGTGtGTGCCCT. ^ ; >. . / ^ : : . ... . 

+TGGTGAtTGCCCT 

+TGGTGctTGCCCT 

+TGGTGgtTGCCCT . 

+TGGTGttTGCCCT 

The trellis strategy employs an array of probes having at 
least three cells/ each of which is occupied by a pooled probe 
as described above. 

Consider the use of three such pooled probes for 
analyzing a target sequence, of which one position may contain 
any single base substitution to the reference sequence (i. e, 
there are four possible target sequences to be distinguished) . 
Three cells are occupied by pooled probes having a pooled 
interrogation position corresponding to the position of 
possible substitution in the target sequence, one cell with an 
'N', one cell with one of TM' or 'K', and one cell with 'R' or 
•Y*. An interrogation position corresponds to a. nucleotide in 
the target sequence if it aligns adjacent with, that nucleotide 
when the probe and target sequence are aligned to maximize 
complementarity. Note that although each of the pooled 
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probes- has two other pooled positions, these positions are not 
relevant for the present ■ illustration . ^ The positions are only 
relevant-wheh more than one position in the - target^ sequence is 
to be read, a- circumstance - that will be . considered • later . For 
present purposes^ the cell with the 'N' in the interrogation 
position lights up for the wildtype sequence and any of the 
three single base substitutions of the target sequence. The 
cell with M/K in the interrogation position lights up- for the 
wildtype sequence -and one of the single-base substitutions. 
The cell with R/Y in the interrogation position lights up for 
the wildtype sequence ' and a . second, of . the single-base - 
substitutions. Thus,, the four possible target sequences 
hybridize to the three pools of probes in four distinct 
patterns, and the four possible target- sequences can be 
distinguished. 1. 

To illustrate further, consider four -possible target 
sequences (differing at a single position) and. a pooled probe 
having three pooled positions, N, K and .Y with the Y position 
as the interrogation position (i .e aligned . With the variable 
position in the target sequence) : .* ; . v- v ' 
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Targetr ■ \ ■ \" T 

ATTAACCACTCAGGGGAGCTCT . (w) . 
ATTAACCACTCcCGGGAGCTCT (c) ^ " 
ATTAACCACTCgCGGGAGCTCT (g). 
ATTAACCACTCtCGGGAGCTCT (t).-;_ 

TGGTGNKYGCCCT (pooled probe). ■ 

The sixteen individual component prpbes of the pooled probe 
hybridize to the four possible target secjuences as follows: 







TARGET 






w 


c 


g 


' t 


TGGTGAGcGCCGT 




n . 


' y. ^ 


n 


TGGTGcGcGCCCT 


n 


n 


n 


n 


TGGTGgGcGCCCT 


n 


n 


n 


n 


TGGTGtGcGGCCT 




n 


n 


n 


TGGTGAtcGCCCT 


n 


n 


n 


. ■ ^- 


TGGTGctcGCCCT 


n 


n 


n 


n 


TGGTGgtcGCCCT ■ 


- n ■ ' 


n 


n 


n 


TGGTGttcGCGCT 


n 


n 


n _ 


n 


TGGTGAGTGCCGT ~^ 


• y ■ ' 


n ' 


^ n 


n 


TGGTGcGTGGGGT 


n 


n 


n 


^ n 


TGGTGgGTGGGGT 


n 


n 


. n 


. - ^ 


TGGTGtGTGCGCT 


n 


■ "'h ■ 




' n 


TGGTGAtTGGGCT 


- n 




n 


"/ n 


TGGTGctTGGGGT 


n 


n 


n 


n 


TGGTGgtTGGGCT 


n 


n 


n 


n 


-TGGTGttTGGGCT - 


-n - 


n 


n 


n 



The "pooled probe hybridizes ' according to the aggregate of its 
components: 

Pool: TGGTGNKYGCGCT Y n y n 

Thus /'as stated above, it can be seen that a pooled probe 
having 'a y at the interrogation position hybridizes to the 
wildtype target and one of the mutants . Similar tables ^can be 
drawn to illustrate the hybridization patterns of - probe/pools^ 
having other pooled nucleotides at the interrogation position . 

The above, strategy of using pooled probes to, analyze a 
single base in a target -sequence can readily be' extended to ^ 
analyze any number of bases. At this point, the purpose ox 
including^ three pooled positions .within each probe will become 
apparent, in "^the' exaifiple that follows,' ten pools of probes, 
each containing three pooled probe pb^sitions , can be' used to 
analyze a each of a contiguous sequence of eight nucleotides 
in a target sequence. 



Wild: 
Mutants : 
Mutants : 
Mutants : 
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ATTAACCACTCACGGGAGCTCT- ^'Reference sequence 
' Readable nucleotides 

Pools : . 



4 TAATTNKYGAGTG - 

5 AATTGNKRAGTGC" 

6 ATTGGNKRGTGCC 

7 TTGGTNMRTGCCC- 

8 * TGGTGNKYGCGCT 

9 GGTGANKRCCCTC 

10 GTGAGNKYCCTCG 

11 * TGAGTNMYCTCGA 

12 GAGTGNMYTCGAG 

13 AGTGCNMYCGAGA. 



In this example, * the different pooled^ probes tile across 
the reference sequence, each pooled probe ..differing from the 
next by increments of one nucleotide; For , each of the . 
readable nucleotides ±n the reference sequence, there are 
three probe pools having a pooled interrogation position 
aligned with the readable nucleotide. , For example, the 12th 
nucleotide from the left in the reference sequence is aligned 
with pooled interrogation positions in pooled probes 8, 9, and ^ 
10. Comparison of the _ hybridization intensities of these 
pooled probes reveals the identity of the nucleotide occupying 
position 12 in a target sequence. 









Pools 






Targets 


' B 


- 9 


10 


Wild: 


ATTAACCACTCACGGGAGCTCT — - , - 


Y 


,-Y • 


Y- 


Mutants : 


ATTAACCACTCcCGGGAGCTCT 


N 


Y 


Y 


Mutants : 


ATTAACCACTCgCGGGAGCTCT 


Y 


N . ^ 


Y 


Mutants : 


ATTAACCACTCtCGGGAGCTCT 


• , . N ■ 


• N 


Y 



Example- Intensities: 





= lit cell ^ 


wild ' 












'= blank cell 


' C • ' 




























* T ' 














None 









Thus, for example, if pools S, 9 and 10 all light up, one 
knows the target sequence is wildtype, If pools, 9 and 10 
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- - light up;^ the . target^ sequence- has^/a-. C, mutant/ .at---posit:ion 12 . 
If pools 8 and 10 light up, the target -sequence^ has; a G- mutant 
at position .12. If^only.pool 10 lights up, the target 
sequence has a t mutant at position 12. ' / 
5 The identity of other nucleotides, in the target sequence 

is determined by a comparison of other sets; of. three pooled 
probes. For example, the identity of the 13th nucleotide in 
the,target sequence -is -determined -by. comparing the ^ 
hybridization ;patterns of the probe pools , designated :9 ; . lO and 
10 11- .similarly, the- identity of the 14thvnucleotide in. the 

target s-quence is.\determined by .comparing the hybridization 
patterns of the probe pools designated. 10 , 11, and 12. 

. In the, above example, successive probes tile acrpss the . 
reference sequence in increments of one nucleotide and each 
15 probe has three interrogation positions . occupying, the -same 

- positions .dn. each probe relative .to the terminus , of .^the probe 
(i.e., the 7, 8 and. 9th positions relative to the, 3.' .^ : 
terminus) • However, the trellis strategy does not„require 
that probes tile in increments of one or that the 
20 interrogat ion position, ppsitions occur in the same, .position in 
each- probe.; In a^ variant of- -trellis- tiling referred; to .as 
"loop" tiling, a nucleotide of interest in a -target sequence 
is read by comparison of pooled probes, which each have a 
pooled interrogation position corresponding to the nucleotide 
25 of interest, , but in which the spacing pf the interrogation 
position in the probe differs from probe to probe,: . . 
Analogously to the block tiling approach, this allows several 
nucleotides to be read from a target sequence from a 
collection of probes that are identical except at the 
^30/ interrogation position. The- identity in sequence of probes, . 
particularly at their 3' termini , simplifies synthesis of the 
array and result in more uniform probe density per cell. 

To illustrate the loop strategy, consider a reference 
sequence of which the 4 , 5 , 6 , 7 and 8th nucleotides (from the 
35 3* termini are to be read. All of the four possible 

nucleotides at each of these positions can be read from 
comparison of hybridization intensities of five pooled probes . 
Note that the pooled positions in the probes are different 
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"('f or- example ^in- probe- ' 55 V ■ the pooled positions are* 4 , 5 and 6 

and . in probe 56 , • 5 , - 6 and 7 ) . 

TAACCACTCACGGGAGCA Reference sequence • - 
55 ATTNKYGAGTGCC 
56 • ATTGNKRAGTGCC' . 

57- • ' ATTGGNKRGTGCC - . - 

58 ATTRGTNMGTGCC . 

59 ' ATTKRTGNGTGCC 

Each position of interest in the reference ..sequence -is read by 

.comparing hybridization intensities for the-three probe -pools 
that have an interrogation • position aligned -with- -the - ^^ 
nucleotide- of interest in the reference sequence 'For " " 
example to- read the fourth nucleotide -in the reference 
sequence/ 'probes 55 , 58 and 59 provide pools at the fourth 
pbsitiort. -* Similarly, to read the fifth nucleotide iri- the 
reference sequence/ probes 55/ 5 6 and 59 proVide pools' *at the 

-f i'fth -position. As in the previous trellis strategy/ one of 
the three -probes being compared has an N- at the pooled 

•.position and the other two have M or K,- and *(2) R or Y and (3) 
W or S- ■ : ^ ^ ■ - - .-• ' . . . • - . . ' 

: .The hybridization- pattern of the five pooled: probes to 
target sequences representing each possible nucleotide 
substitution at five positions in the reference sequence is 
shown- below. Each possible substitution results i'n^ a unique 

-hybridization pattern at three pooled probes, and the- identity? 
of -the -nucleotide at that position- can -be deduced- from the 
hybridization -pattern. *" - - ' — ^ 
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A - Pools 





Targets 
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W i Id* 


TAACCACTCACGGGAGCA • 
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Y , ^ 


- Y 
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TAAgCACTCACGGGAGCA 


Y ^ ' ■ 




: ■ N " 
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' :■ . N"" 




TAAtCACTCACGGGAGCA 


; Y . . 


.: N . : . 




Y 




Mutant : 


TAAaCACTCACGGGAGCA 


Y ; 




N 


' N 


Y 


Mutant: 


TAACgACTCACGGGAGCA . 


.: N • , 


, Y : 


N • 


^ N .:. . 


N.. 


Mutant I 


TAACtACTCACGGGAGCA 


N 


. Y . 


N 


N 


Y 


Mutant! 


TAACaACTCACGGGAGCA 


• Y ■ 


Y 


N ' 


n' 


- n' 


Mutant: 


TAACCCCTCAGGGGAGCA 


N 


Y 


Y 


N 


N 


Mutant*" 


TAACCgCTCACGGGAGCA 


Y ■ 


N ' 


Y 


N ' 


" ^ N . ' 


Mutant ! ' 


TAACCtCTCACGGGAGCA. 


: N . / 


V.N /• 




. . N 


. .-,N ' 


Mutant : 


TAACCAgTCACGGGAGCA ^ 


N ■ 


N - 


N • 




N 


Mutant:. 


TAACCAtTCACGGGAGCA - 




. -Y, ~ 


- N ' 


• Y 
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Mutant : 


TAACCAaTCACGGGAGCA 


N 


N 


" Y . 


. Y " 
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Mutant : ■ 


TAACCACaCACGGGAGCA 


. N . 
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-N , 
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Mutant : 


TAACCACcCACGGGAGCA 


' N ' 
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N 


Y 


Mutant :' 


TAACCACgCACGGGAGCA 


. N^' ■ 


N : ' 


' N 






Many 


variations on the loop 


and trellis 


tilings 


can 


created. 


All 'that is required is 


that 


each ■ 


position 


in . 



sequence inust have a probe with a 'N' , a probe containing one'^ 
of R/Y-, M/K or W/S/ -and a probe- containing 'a different pool 
from that set; complementary to the wild type target 'at ' that 
position/ and at least one probe with no pool at all at that 
position- This combination allows all mutations at that^ 
position to be uniquely detected and identified. - 

• A * further class of ' strategies involving . pooled ■ probes are 
terme'd coding strategies. These strategies assign code 'words 
from some set of numbers to variants of a reference 'sequence . 
Any" number of variants can be coded. The variants can include 
multiple 'closely spaced substitutions, deletions or 
insertions. The designation letters or other symbols assigned 
to each variantmay be any arbitrary set of numbers / in any 
order. For example, a binary code is. often used, but codes to 
other bases are entirely feasible. The nufribers are "often 
assigned such "that each variant has a designation having at 
least one digit' and "at least one nonzero value for that digit. 
For example, in a binary syster., a variant assigned the number 



101, has- a-'designation of three digits , with one possible 
nonzero value for each digit. 

The designation of the variants are coded .into an ;array 
of pooled probes comprising a pooled probe for each nonzero 
value of each digit in the numbers assigned 'to .the variants , 
For example, if the variants are assigned successive number in 
a -numbering system of base m, and the highest . number • assigned 
to a variant has . n digits, the array. would have about n .x- (m- 
1) pooled probes. In general log^^ (3N+1) probes are required 
to analyze all variants of N locations in a.. reference. , 
sequence, each having three possible- mutant . substitutions . 
For example,- 10 base pairs of sequence may be . analyzed with 
only 5 'pooled probes using a binary coding system. . 
Each pooled probe has a segment exactly complementary to ' the 
reference sequence except that vcertain positions are pooled. 
The segment ; should be sufficiently long to allow specific, 
hybridization of the pooled probe to the reference sequence 

.relative to a mutated form of the reference .sequence. As in 
other -tiling strategies , segments lengths of 9-21 nucleotides 
are -typical .-• Of ten the probe has no nucleotides, other, .*than 
the -9-21 • nucleotide segment. The pooled positions comprise 
nucleotides that allow the pooled probe to hybridize to every 
variant assigned a particular .nonzero value in a particular 
digit. Usually, the pooled positions further comprises, a 
nucleotide that allows -the pooled prpbe to hybridize ^to the 
reference sequence. Thus, a wildtype target . (or ; reference 
sequence) is immediately recognizable from all the pooled-. 

-probes r being lit. . *. - - - ; - 

When a target- is hybridized to the pools, only, those 
pools comprising a component . probe having. a segment that is 
exactly ■ complementary to -the target light .up. The .identity of 
the ^target is then decoded from the pattern .of .hybridizing 

pools. Each pool that lights up is correlated with a. 

particular value in a particular digit. Thus, .the .aggregate 
hybridization patterns of each lighting pool .reveal .the value 

.of .each-. digit *in the code , defining the identity of. the target 
hybridized to .the .array . , 
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-AS an example consider;- a , re-ferencer. .sequence, having . fou: 
positions, each of which can be occupied by three possible 
mutations. Thus, in total there are 4 x 3 possible variant 
forms of the reference sequence. Each variant is assigned a 
'binary- number binary, numbers 0001-1100 and the wildtype ^ 
reference sequence is* assigned the binary number 1111. * 



Positions . ... 

Target: TAAC 0=1111' 
CAGGGGAGCA 

G=0001 
T=0101 
A=1001 



X • : , " 

A=llll 

C=0010 
G=='011O 
T=1010 



X 

C=llll 

G=0011 
T-0111 
A=1011 



X - 

T=iii-f 

A=0100 
C=1000 
G=1100 



A first pooled probe; is designed by including probes ^that 
complement exactly each variant having a 1 in the first' digit. 



2 0 



25 



30 



target (1111) 
Mutant (0001) 
Mutant (0101) 
Mutant (1001) 
Mutant (0011) 
Mutant (0111) 
Mutant (1101) 



TAAC 
TAAC 
TAAC 
TAAC 
TAAC 
TAAC 
TAAC 



First pooled probe 

ATTG 
ATTG 



t ' 



C 
C 
C 



[GCAT] 
N 



A^ 

A 

A 

A 

A 

A 

A 



T 
T 



C 

c 
c 
c 



[GCAT] 



T CACGGGAGCA 
T CACGGGAGCA 
T CACGGGAGCA 
T CACGGGAGCA 
T CACGGGAGCA 
T CACGGGAGCA 
T CACGGGAGCA 



A GTGCCC 
■A GTGCCC 



35 ' 



40 



Second, third knd fourth pooled probes are then designed 
respectively including component probes that hybridize- to each 
variant having -a 1 in the second/ third and fourth digit. 
- ... . . XXXX . - ,4 positions examined . _ 



Target : 
Poor 1(1) 
Pool 2(2) 
Pool 3(4) 
Pool 4(8) 



TAACCACTCACGGGAGCA 
ATTGnTnAGTGCCG- = 
ATTGGnnAGTGCCC = 
ATTGyrydGTGCCC = 
ATTGmvmbGTGCCC = 



16 probes 

16 probes 

2 4 probes 

24 probes 



(4xix4xl) 
( 1x4x4x1) 
(2X2X2X3) 
(2x2x2x3 ) 
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The 'pdble'd- probes hybridize" to*" :varian targets as follows: 
Hybridization pattern: 
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Mutant (0101) : 
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Mutant (1001) : 


' " TAACaACTCACGGGAGCA 
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Mutant (0110) : 


• TAACCgCTCACGGGAGCA 


N 


. Y, 
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N 


Mutant- (10 10) : 


. TAACCtCTCACGGGAGCA , . 


. . N 




; r; 


Y 


Mutant (0011) : 


TAACCAgTCACGGGAGCA 


Y 






N 


Mutant-( Olll) : 


- . TAACCAtTCACGGGAGCA- 


Y 


Y 


Y 


N 


Mutant (1101) :. ; 


- TAACCAaTCACGGGAGCA ; , , 


Y 


N 


Y 


^ Y 


Mutant (0100) : 


TAACCACaCACGGGAGCA 


N 


N 


Y 


N 


Mutant (1.000) : - 


_ .TAACCACcCACGGGAGCA : ; . 


. . N - 






Y 


Mutant (1100) : 


TAACCACgCACGGGAGCA ' " ' 
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N " 


- y'" 


Y 



10 
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The identity of a variant (i.e., mutant) target is read 
25 , ^ directly from the hybridization pattern^ of the pooled probes. 
For example the mutant assigned the number OOOi' gives a . 
hybridization pattern of NNNY" with respect to" probes" 4 , 3, 2 
' and,;l respectively. : ' " y -V 

In the above example, variants are assigned successive 
3 0 numbers in a numbering system. In other embodiments, sets of 
numbers can be chosen for their, properties. If the codewords * 
, .are chosen from an error-control, code, the properties of that 
code carry over to sequence analysis. An error code is a 
numbering system, in which some designations are assigned- to 
35 , yariants and other designations serve to indicate errors that 
may , have o.ccurred in the hybridization process. For- example, 
if all codewords have an odd number of nonzero digits ('binary 
coding+error detection*)., any single error • in hybridization 
: will. ;be detected -by having an even number of pools. litV" ' 

40- , ' '"" ' ' ' ' '■]' '[ : "//^ ' ■ 

wild " . ' , ■ . ; - ' : ' : " ■ 

Target: TAACCACTCACGGGAGCA 

Pool 1(1): ATTGnAnAGTGCCC = 16 Probes (4x1x4x1) 

45 . Pool 2(2): ATTGGnnAGTGCCC = 16 Probes (1X4X4X1) 

Pool 3(4): ATTGryrhGTGCCC = 24" Probes (2X2X2X3) 

Pool 4(8): ATTGkwkvGTGCCC = 24 Probes (2X2X2X3) 
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Pool:.5 (c) : ; .ATT.GdhsmGTGCCC ^ = 


36 


probes . 




„ (2x2x3x3) 






Hybridization 


of pooled probes -to - 


'targets 
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Target ■ ' 
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Target.(lllll) 
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. . Y. 
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Mutant ( 00001 ) 
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Mutant (10101) 
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Mn 1- 3 n 1- M 1 0 01^ 
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N , . 
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Mutant (boo 10) ' 


: .TAACCcCTCACGGGAGCA 




N ^ ■ 
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N ' ■ 


N 
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Mutant (10110) 


: - TAACCgCTCACGGGAGCA 




N.. . . 
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;y . , 
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Mutant (11010) 


: TAACCtCTCACGGGAG.CA 
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Y 
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Mutant (10011) 


: TAACCAgTCACGGGAGCA , 
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Y .. 




■ N 
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20 


Mutant (00111) 


: TAACAtTCACGGGAGCA . 




Y 


Y 


Y 


N 


N 




Mutant'(OllOl) 


: TAACCAaTCACGGGAGCA 




Y 


N 


Y 


Y 


N 




Mutant (00100) 


: TAACCACaCACGGGAGCA 




N 


N 


Y 


N 


N 




Mutant (01000) 


: TAACCAcCCACGGGAGCA 




N 


N 


N 


Y 


N 


25 


Mutant (1110 0) 


: TAACCACgCACGGGAGCA 




N 


N 


Y 


Y 


Y 



9. Bridging Strategy 

Probes that contain partial, matches to two separate 

30 (i.e., non contiguous) subsequences of ^a target sequence 
sometimes hybridize strongly to the target sequence. In 
certain instances, such probes have generated stronger signals 
than probes of. the same length which are" perfect matches to ^ 
the target sequence. It is. believed (but not necessary to the 

35 invention) that this observation results from interactions of 
a single target sequence with two or more probes *; 
simultaneously.' This invention exploits this observation to 
provide arrays of probes having af least first and. second 
' segmerits; which are respectively complementary to first and 

40 second subsequences of a reference^ sequence. Optionally, the 
probes may have a third or more complementary segments.,; These 
probes can be employed in any. of. the strategies noted above. 
The two segments of such a probe can: be complementary to 
disjoint' subsequences of the reference -sequences or contiguous 

4'5" subsequences;' if - the • latter the- two - segments, in .the probe 
' are' inverted-^ relative ■ toi. the order- of- the: complement of. the - 
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reference 'sequence . ^ The* "twc -subsequences - of ^- -the ' reference 
sequence each typically comprises about 3 to 30 contiguous 
nucleotides. The subsequences of rhe reference sequence are 
sometimes separated by 0, 1, 2 or -3 bases.. Often ^ the ; 
sequences, are adjacent and nonover lapping - ... • 

For example, a wild-type probe is created by 
complementing two sections of a reference sequence (indicated 
by subscript and superscript) and reversing- their order.* The 
interrogation position is designated :(*) .and/.is apparent;, from 
comparison of the structure of the wildtype probe, with the 
three 'mutant probes. The- corresponding .nucleotide in the 
reference sequence is the "a" in the superscripted ^segment . 

Reference: , 5' T^gcta^^^^^^'^^^'^^'^^'^'^^^ • ^ /..v \ 



Probes: 3' GCTCC CCGAT (Probe from first probe set) 

3 » GCACC CCGAT 
3' GCCCC . CCGAT • . / ^ 

3.' GCGCC CCGAT - J-' 

The expected hybridizations are: 
Match: , ' ~- 



' GCTCC CCGAT . 

; ... .TGGCTACGAGGAATCATCTGTTA ^. 

GCTCC CCGAT ' * ' ' 

Mismatch: ..... . . . = . . - . :v . . ^ 

GCTCCCCGAT 

" ' . . : TGGCTACGAGGAATCATCTGTtA -' ' - - 
. * — - ■ - ^ ^ • - GCGCC CCGAT . . . • . _ ^ - - 

Bridge, tilings are specified using -a notation which gives 
the length of. the two ..constituent segments and .- the ^relative 
position of the interrogation position . - The ..designation n/m 
indicates a segment complementary : to a region of .the. reference 
sequence, which extends for n; bases and . is Ipcated such that 
the interrogation position is in the .mth base frpm^the-^* end. 
If m is. larger:, .than n , this, indicates, that the entire segment 
is to the 5* . side of. the. interrogatiorir position, -If m is 
negative,-, it. indicates that .the interrogatipn.. position' is the 
absolute, value., ot. m bases- 5' of the first base of .the segment 
(m cannot, be zero.).-. Probes .comprising multiple ..segments , such 
as n/m + a/b + ... have a first segment at the 3' end of the 
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■ probe and^^additionai ■ seginents .added, 5-; : with respect to the 
first segment; ^ For example, a 4/ 8- tiling cons ists of (from' - 
the 3' end. of the . probe) a 4. base complementary, segme 
' starting 7 bases 5* of ; the /interrogation position, followed by 
5 a -6 base region in which;, .the interrogation position , is located 
at the third base. . Between these two segments .one, base, from 
the reference sequence "is omitted.,. By this notation, .the set 
shown above is^a 5/3 + 5/8 tiling.; Many different tilings are 

- . possible-, with this, method, since, the lengths of^ both, segments 
10 can be varied, as well as their relative position (they may be 

in . either- order and there may be a, gap between them) and their 
location relative to the interrogation -posit ion. . 

As an example, a 16 mer oligo target was hybridized to a 
chip containing all 4^° probes of length 10. The chip 
15 includes short tilings, of. both standard , and bridging types. ; 

- The- data- f rpm-a^-standard lO/.S.. tiling .^as compared- Jto data from 
a 5/3 + 5/8 bridge tiling {see Table 1) .. . Probe^intensities 
(mean count/pixel) are displayed along with discrimination 
ratios (correct probe- intensity /■ highest incorrect probe 

20 intensity,)::.., : .Missing: ;intensity values are less than ,50 .counts.. 
. Note- that -.for each base displayed the bridge tiling has ..a 
higher discrimination value, , 



2 5 



30 



3 5 



45 



TABLE 1: Comparison of Standard and Bridge Tilings 



TILING 



STANDARD 
(10/5). 



DISCRIMINATION: 



BRIDGING 
4 0 - ■ 5/3 5/8^^ 



DISCRIMINATION: 



BASE : ^ 


' CORRECT 


PROBE 


BASE 




C 


A 


c ' 


c 


A 


' 92 


496 ■ 


294 


299 


c 


536 


148 


532 


534 


G 


69 - 


-.167 - 


72 , 


52 


T 


146 " 


95 


212 


■ 126 




■ 3 J?'" 


'3.6 


1.8 " 


1.8 ' 


A 




404 




156 


C 


- ^ 276' 




345 


379 


G 




.80 






T . ■ 








5 8 




>5 . 5 


■ 5.1 


2.4 


1.26 



ThV' bridging 'strategy "offers the" following- advantages : 
' {iy -'Higher discrimination between matched and mismatched 
probes, ' — . . .. _ . _ , . 
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* (2) The possibility^ of-*-using-' longer probes - in a-bridging 

tiling," thereby increasing the specificity of ' the , / ' 
hybridization/ without sacrificing ■ discrimination / • : 

' ■ (3) The use of- probes' in which an: interrogation-.po.sition 
is located very of f -^center . .relative, .to the . regions . of target 
complementarity.' This may be of particular, advantage when, 
-for example/ when a probe centered about one region, of the 
-target gives- low hybridization signal . / The . low signal, is 
overcome by using a probe- centered about. an adjoining region 
■giving a- higher- hybr-idization signal . • - . / . , , . ■ 

. (4)'' Disruption of secondary -structure, that: might result 
in annealing- of certain probes (see previous • discussion: of 
helper mutations) . 

- 10 . ' Deletion Tiling - - " / ^ % "r ^ 

Deletion* tiling is related to' both the bridging and 
helper mutant strategies described above." . In the deletion ' 
strategy, comparisons are performed- between probes sharing a'^ 
common deletion but differing -"from each- other :at an ^ 
interrogation position located outside -the deletion . .For ^ ■ 
example/ -a first ' probe comprises first and^ second segments , 
each exactly complementary to respective first and second 
subsequences of a reference sequence, wherein the first and 
second subsequences of the reference sequence are • separated *=By 
a short distance (e.g.., 1 or 2 nucleotides). The order of the 
first and second segments in the probe is usually the same as 
that of the complement to the first and second subsequences in 
the reference sequence. , The interrogation position is. usually 
separated from . The comparison is performed with three, other 
probes, which are identical to the first probe except at an ■ 
interrogation position, which is different in each 'probe'. 
Reference:, . . AGTACCAGATCTCTAA . . . ' . 

Probe set:. . CATGGNC AGAGA (N =, interrogation' position) . 

Such tilings sometimes offer superior discrimination in 
hybridization intensities between the probe having an 
interrogation- position complementary to- the target - and other ' 
:probes"; : Thermodynamically , the difference- between ^ the 
hybridizations to matched and mismatched targets for ..Xhe probe 



• so 

set ^ shown- above^ :is the- di-f f erence ^between; a; single-base^ .bulge , 
and a large asygmietric loop (e . g . / two bases of target ; . one of 
probe). This often results in a. larger difference in 
stability than the comparison of a perfectly matched probe 
5 * with a probe showing, a. single , base - mismatch . in . the basic 

tiling strategy, . -} . ' _ 

: , The , superior discrimination of fered by- deletion tiling is 
illustrated, by Table 2, which: compares hybridization ^ data from 
a-istandard 10/5 tiling, with- a- 1(4/8 + : 6/ 3 ) deletion •.tiling of 

10 the , reference sequence. . (The numerators indicate the. length - 
of the^ segments, and the denominators, the^spacing of the.- 
deletion from the far termini of the segiuents. ) Probe \ 
: ; intensities- (mean count/pixel) - are displayed along with 
discrimination ..ratios (correct probe intensity / ■ highest 

15 incorrect ' probe intensity ) . Note that for each base ...displayed 
the deletion -tiling- has.:a higher discrimination value than 
either standard tiling shown. 



TABLE -2 . i . Comparison of Standard and : Deletion .Tilings 



20 





- : TILING ' 


PROBE 


^ BASE : 


' ^ ■■■CORRECT- 


PROBE 


BASE '~ 










c " 


^a' 


C 


C . , - 


25 






A 


92 


496 


294, 


299 




• STANDARD 




C ' 


536 


148 


532 


; 534 




(10/5) 




G 


69 


167 


72 


' 52 








T ■ 


- 146 


95 


.* 212 


.126- ■ ■ ■ : 


"30 


DISCRIMlkATiON 






. 3.7 ■ 


'3:0 


i.:8- 


^ ^lv8 ~ 








A 


■ ^ g ' . : - 


412 - 


29 


-48 ' ' . " 




DELETION 




C 


, . 297 


32 


465 


. 160 . . . 




4/8 + 6/3 




G 


'8 


77 


10 


4 ■ 


35 




T - 


8 ^ ' 


• 26 -. 


; 31 - 


5- . ■ ■ 




DISCRIMINATION: 






37.1 


5.4 


15 


3.3: . ■ ' ■ 








A^ : 




"53 3 


' 22 8 


277 * -7 


40 


STANDARD ^ . ; . 




c , 


. 729 . 


194 


536 


496 ... 




^ (10/7) ' 




G 


' 2 32 


231 


102 . 


■ 89 ■ ■ • • 






T 


. . - . 344 ' 


133^ , 


. 163 


150. . . . 




DISCRIMINATION: 






^'•..2.1 ■ • 


2.3.' 


. 2.3 


, 1, 6 ; • 


45 


















..The use. of. deletion. 


or bridging pr.obes 


is quite, general. , 




These probes 


can be 


used 


in any of 


,the 


. tiling,, strategies 



the invention. As well as offering superior discrimination, 
50 the use of deletion or bridging strategies is advantageous for 
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• certain -probes^ to avoi'd "self "hybridization: ( either - within a 
•probe or between two probes of the same sequence) 

• c' ' Preparation of- Target Samples - : 

5 - The : target ' polynucleotide, • whose sequence is . to be . '-' 

determined, is usually isolated from a tissue sample: . If the 
tariget" is genomic , the* sample may" be from any • tissue (except 
exclusively red blood cells) • For example whole blood, 
peripheral' blood lymphocytes or PBMC-, skin, - hair - or- semen are 

10 convenient sources of -clinical. samples. These' sources are 
also suitable -if the- target; is RNA.'^ Blood • and ' other body 
fluids are also a 'convenient source for isolating* viral 
nucleic ■ acids " 'If the target is mRNA," the sample is obtained 
from- a - tissue in which the mRNA is expressed ; ' If the - ■ 

is polynucleotide in -the sample is RNA'/ it ' is usually reverse 
transcribed to DNA. --DNA samples or . cDNA- resulting from ' ' 
reverse transcription are usually amplified, -e.'g., by -PGR. 
Depending on the selection of primers and amplifying 
enzyme(s), the -amplification product can be RNA or DNA; - - 

20 Paired primers -are selected to flank- the borders of a target 
polynucleotide of interest. More than one target can be 
simultaneously amplified by multiplex PGR in which multiple 
paired primers -are employed. The target can be labelled at 
one or more nucleotides during or after amplification. For 

25 some target polynucleotides (depending on size of. sample) , 
e.g., episomal DNA, sufficient DNA is present in the tissue 
sample to dispense with the amplification step. 

When the target strand is prepared in single-stranded 
form as in preparation of target RNA, the sense of ;the strand 

30 should of -course be complementary to that of the probes on the 
chip. This is .'achieved by appropriate selection of' primers. " 
The target is preferably fragmented before application to the 
chip to reduce or eliminate .the -formation of -secondary : : 
structures in the target. The average size of targets 

35 segments following hybridization is usually* larger than the 
size of ' probe on' the "chip " ~ „ 
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Ii: ILLUSTRATIVE CHIPS - - ^ ' 

■ A. HIV Chip / ^ ; ^ v 

HIV has infected a large and . expanding number. of people, 
resulting in massive health care expenditures,. HIV can ^ 

, 5 rapidly., become, resistant , to ..drugs used .to . treat the -infection', 
primarily due to the action of the heterodimeric. protein .i ( 51 
kDa .and 66 kDa) HIV reverse transcriptase (RT) both subunits 
of 'Which '.are encoded by the 1.7 kb pol gene. The high' error 
rate. (.5-10 per round), of the^ RT protein is believed to- account 

l6\ for the hypermutabiiity of HIV. The nucleoside analogues , 
i.e., A2T, ddl/ ddC, and d4T, commonly used to treat HIV 
infection -are. converted to nucleotide analogues by sequential 
phosphorylation -in the- cytoplasm of ^ infected cells,- where 
incorporation of the .analogue into ,the. viral DNA /results in 

15 termination of viral replication, because the 5 ->_.3:* : ; r 

phosphodiester- linkage cannot.be completed . 4. However > after 
about 6 months to 1 year of treatment or less, HIV typically 
mutates the RT gene so as to become incapable of incorporating 
^ 'the 'analogue and so resistant/to treatment. . Several mutations 

20 known" to be associated with drug resistance are shown^in the 
-table below, , After a- virus having- drug resistance via 'a ; 
mutation- becomes predominantv the patient suffers dramatically 
. ^increased „viral . load, .worsening symptoms (typically more.- 
frequent and dif f icult-to-treat infections) , and ultimately 

25 ' death. .> Switching. to a different treatment regimen .as soon as 
a resistant -.mutant virus takes hold may be an important step 
in patient management which prolongs patient- life and reduces 
morbidity/ during life. . . i . ^ / 
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SOME RT MUTATIONS ASSOCIATED WITH DRUG RESISTANCE 



ANTIVIRAL 


. - . CODON : 


- ' aa CHANGE • ' 


nt CHANGE 


AZT ... . . 


■ SI • 


Asp ->- Asn 


GAC -> ;AAC " 


AZT.^, : - ■ 


■ • 70 : . 


Lys -> . Arg ^ \ 


, AAA ->:;AGA 


AZT'-'-' .'.^ 


215 ^ 


Thr '->" Phe or Tyr - 


ACC -> TTC or TAC 


AZT*' -"^ 


, : 219 ^• 


Lys -> Gin or Glu-' 


AAA' -> CAA: ^or GAA 


azt'-" ■ * " 


" '41 " 


Met ->■ Leu 


ATG' ^TTG or CTG 


ddl and ' dd'c' 


184 


Met -> Vai 


ATG GTG 


ddl "and ddb 


74 


Leu -> Val 


... - ..... 


TIBO 82150 


100 - 


Leu -> lie . - 




ddC.: 


-■• - / -65' . 


Lys Asn • - 1 


AAA AG A:. : 


-ddC : . •: 


: .69.-- 


Thr, -> Asp 


ACT -> GAT > 


;3TC ^ 


184 - 


Met -> Val 


' ATG -> GTG or GTA 


'•3Te ' 


184 


Met- lie 


ATG* -> ATA * 


^'AZT * 'ddl ' 


62 


Ala ^-V Val ' 


GCC ->'GTC' 


AZT '+ ddl 


75 " * 


Val -> lie* 


GTA -> ATa" 


AZT + ddl 


77 


Phe -> Leu 


TTC ->.. TTA 


AZT > ddl 


,116 


Phe -> Tyn 


TTT^ -> TAT ■ 


AZT + ddl. 


, 151 


Gin Met. 


.CAG -> ATG ■ 


, Nevaripine i-. 


- : :103 


Lys . Asn -^"^ ^ - ■ 


AAA:— > AAT--' 




i:06 ' ; 


Val' -> Ala ' - • 


:GTA'' ->-"■ GCA 




" ' * 108 








181 


Tyr -> Cys ' 


" TAT -> TGT " 




188 


Tyr -> His 


TAT "> CAT 




190 


Gly -> Ala 


GGA -> GCA 



N.B.. Other mutations confer resistance to other drugs. 



A second important therapeutic target for anti-HIV drugs 
is the aspartyl protease enzyme encoded by the HIV genome, 
whose function is required for the formation of infectious 
progeny. See Robbins & Plattner, J. Acquired Immune 
Deficiency Syndromes 6, 162-170 (1993); Kozal et al., Curr. 
Op, Infect. Dis . 7 : 72-81 ( 1994 ).. The protease function in 
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process ing--" of viral precursor polypeptides . to-.their .-active , 
forms.- Drugs ' targeted against, this ^ en2>Tne do- not- impair 
endogenous human proteases , ' thereby achieving^- a- high, degree of 
selective toxicity. Moreover , the . protease , is expressed: later 
-in-the -^lif e-cycie^ that ^reverse transcriptase, thereby.', offering 
.the ' possibility of ' a. combined attack- on. HIV: at two different 
times in,. its life^cycle'. As; for drugs: targeted against the 
reverse : transcriptase, .administration of drugs: to/ the . protease 
!;can .result, in acquisition of drug resistance^ through::,mutation ' 
of the protease. , By . monitoring the protease gene from . / 
patients, it is possible ' to detect the .occurrence of . : 
mutations, and thereby make' appropriate ad j.ustments /in the 
drug(s) being administered, - ^ /; \ r--- v : r: v 

In addition .to being /infected with .HIV> AIDS ^patients are 
of ten . also infected 'With a wide .variety, of other infectious : 
agents -giving^ rise to- a complex series .of vsymptoms - :— Often 
diagnosis and treatment is difficult because many different 
pathogens :( some life-threatening, others routine) cause- 
similar : symptoms . . Some. of these inf ections so-called 
:oppqrtunistic .infections, . are caused by bacterial, = fungal, 
protozoan; or viral pathogens which are normally present- in 
small quantity in. the body, but are held in- check by the 
immune system, J When the immune system in AIDS patients fails, 
these normally latent pathogens can grow . and generate rampant 
.infection. In treating such patients , it; would- be desirable 
simultaneously .to -diagnose the presence ^or absence; of . a ^t. 
-variety of the most . lethal common ■ infections , determine _ the 
most effective therapeutic regime against the HIV virus, -and 
monitor- the overall status of the patient ' s infection . 

The. present , invention pr bvides:; DNA' Ichips for ; detect ihg 
the multiple -.mutations in -HIV genes : associated , with ^resistance 
.to different therapeutics . These DNA .chips allow ; physicians 
.to -monitor , mutations over time and 'to change, therapeutics if 
resistance develops.. Some chips also^-provide probes for . 
diagnosis .,of . pathogenic microorganisms - that typically, occur in 
AIDS, patients : t -r::, - v % ^ : ^ / : ■ 

:-.:The /sequence, rselected '.as a reference- sequence.^ can.:'be from 
anywhere in the .HI.V -genome , -.but should preferably • cover .a 
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- regron . of the' HIV', genome . in which/ mutations associated -with 

drug resistance are known to occur.. A reference sequence is 
. usually, between about 5, 10, . 20, 50,^100, 5000, .1000, .5, 000 or 
. 10,000 bases. in length,- and preferably is; about 100-1700 . bases 
5^ in. length. Some reference sequences, encompass at , least .part ; 
of the reverse transcriptase:* sequence encoded ; by .the ,pol -.gene . 
Preferably*; the . reference sequence.- encompasses all/ or 
substantially all: (i.e,- about 75 or 90%) of the reverse:. 
' transcriptase . gene. Reverse transcriptase is . the target of 
10 several drugs and as noted, . above, the coding sequence is the', 
site of many mutations associated with drug . resistance . * • .In ^ 
some; chips,. the. reference sequence contains vthe entire * region 
coding reverse transcriptase (850 bp) , and in other 'chips, 
subfragments thereof In some chips," the * reference sequence 
15 includes other siibf raginents of the pol - gene -encoding HIV ' > 
-protease or ' endonuclease, instead of, or as well as the - 
segrn^ht encoding- reverse transcriptase , In some chips the 
ref erence sequence ■ also includes other HIV* genes such - as ienv 
or gag as well as or instead of the- reverse transcriptase 
20 gene. ' Certain regions - of the" gag -arid env^ genes - are ' relatively 
well" conserved,- and- their detection provides a means f or 
identifying and quantifying- the amount of HIV virus infecting 
a patient. In some chips, the reference sequence comprises an 
entire" HIV- genome. > - - ^ - _ J 

25 It - is- not critical from which strain of HIV the reference 

sequence - iis .obtainedv -HIV strains are - classif ied ' as -HIV-T, 
*HIV-II-orHIV-III, and- within these generic groupings there 
are several strains and polymorphic variants of each of these. 
BRU, *SF2^ HXB2, HXB2R are examples of HIV-1 strains, -the • - 
30 sequenceis of which- are • available from GenBank. ** The' reverse 
transcriptase genes of * the'BRU and SF2 -strains differ- at -23 
nucleotides. - The- HXB2 and HXB2R strains have the • same- reverse 
^ transcriptase -gene - sequence which differs from that -of ^ the 
BRU- strain . at- four nucleotides, arid that -of ■'SF2 - by '2 7. - ' - 
3 5 nucleotides . ' In - some" chips , the* reference sequence - . 

corresponds exactly to the reverse transcriptase .-sequence in 
- the- wildtype. -^version -of ■ a strain. '-In bother - chips 'the 
reference • sequence corresponds to - a - consensus sequence ' of 
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several HIV strains. In some chips , the reference sequence 
corresponds to a; mutant form of a HIV strain. 

. Chips . are - designed., in accordance ,. with., the. tiling 

strategies noted above. . The probes are .designed, to be 
complementary to, either .the coding or noncoding strand of the 
HIV. reference sequence. If only one strand is to- be: read, it 
is preferable. tp:;read. . the coding St The greater - - ' 

percentage of A; residues in this, strand relative to the . - 
noncoding strand generally result in fewer -regions of 
ambiguous sequence. 

^ -- Some chips contain additional probes ;or, groups, of . probes 
designed to be complementary to a second reference sequence. 
The second reference sequence is often a subsequence of the 
fdrst reference -sequence, bearing one, or , more cpmmonly 
occurring HIV mutations or interstrain variations, (e.g., . 
within-: Goddns 67 > 7 0 , -215 . or^^ 2 19 - of the. reverse ^transcriptase 
gene) . The inclusion of a second group is particularly^ useful 
for analyzing short subsequences .of the primary reference 
sequence in which multiple mutations are expected, to occur 
within ^ a rShort . distance, commensurate with . the length of . the 
probes (i.e., two - or more .mutations ./within. 9 . to 21 bases) . 

- , The. total number of. :probes on the , chips depends on the 
tiling strategy, the length of .the., reference sequence,.,and the 
options selected with respect to inclusion of . multiple probe 
lengths and secondary .groups . of probes, to. provide., confirmation 
of the -^existence- of -.common mutations. To read , much, or all of 
.the .HIU^ reverse transcriptase gene (857 .b. for the.- BRU strain) , 
chips .tiled by the basic, strategy, typically contain. at. least 
857 X -4 = 34 28 probes . ..... 

' , . The target HIV .polynucleotide , whose sequence is .to. be; ; 
'determined,, is usually isolated from blood., samples . (peripheral 
blood ; lymphocytes^ or PBMC) in the . f orm of . RKA. ...The^, RNA is 
.reverse transcribed to DNA,, .and the DNA product,, is . then. . 
amplified. Depending on the., select ion., of primers and; _ 
amplifying .enzyme , . the amplification . product can. be. RNA . or 
DNA. ... Suitable ^ primers, for amplification of .target are^.. shown 
in the table ^^^below . . . 
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• * TABLE -4'- • ■ r " 

AMPLIFICATION OF TARGET 



TARGET 
SIZE 


FORWARD- PRIMER- 


REVERSE PRIMER : . : . ; 


1.742 bp 


GTAGAATTCTGTTGACTCAGATTGG 


gataagcttgggccttatcta'ttccat 


535 bp ' 


AAATCCATACAATACTCCAGTAl-i'lOC ] 


ACCCATCCAAAGGAATGGAGGTTCTTTC 


323 bp. • . 


, Genbank ^ K020i3 18S9-190S • 


pases 2211-2192 




AATTAACCCTCACTAAAGGGAga 
ggaagaatctgttgactcagattggt ^RT^1-T3) - 


AA m AATACGACTCACTATAGGGAUlcccca 
ctaacuctgtatgtcaueaca-3' C89-391 T7) .* 




AATTAACCCTCACTAAAGGGAga 
aguiacigcatLaccauicctagta rRT#3-T3 ) 






TAATACGACTCACTATAGGGAGA 
tcgacgcaggactcggccigcigaa fHVl-T2) , . 




) . > .... *. • 


AATTAACCCTCACTAAAGGGAGA 
ccttgtaagtcauggtcttaaaggU (HV2-T3) ' 





40 



In* another aspect of • the invention, chips are'provided 
for simultaneous detection of HIV and 'microorganisms that 
commonly parasitize AIDS patients (e.g., cytomegalovirus-^ 
(CKV), Pneumocystis carini (PCP) , fungi (Candida albicans) , 
mycobacteria)'. Non-HIV' viral • pathogens are detected and' their'" 
drug resistance determined using a* similar strategy' as for 
HIV.' That ■ is "groups of"probes are' designed • to 'show ■ ' 
complementarity to a 'target sequence from a region of -the 
genome" of ' a ' nonviral : pathogen known to be associated with 
acquisition of drug resistance. For example/ CMV and HSV 
viruses; which frequently co-parasitize AIDS patients / -undergo'^' 
mutations *'to* acquire" resistance to acyclovir.- " *- - 

- - 'For detection of hon-viral pathogens the 'chips dnclude 
an array of probes which^ allow full-sequence determination of 
16S ribosomal RNA or "corresponding genomic DNA of the 
pathogens. The additional probes are ' designed by' the^ same 
principles as described ' above ' except that the target sequence ' 
is "a'variabie'region from a 16S RNA (or- corresponding -DNA) of 
a pathogenic microorganism . Alternatively the - target ■ 
sequence" can 'be a consensus sequences of variable 16S rRNA 
region^ f rom^multiple- organisms. 16S ribosomal DNA 'and -RNA is 
present ' in *all organisms ( except viruses) - and the sequence of 
the' DNA" or * RNA is closely related to' the evolutionary genetic 
distance between any two species. Hence organisms -which are 
quite close in type (e.g., all mycobacteria) share a common 
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region of 16S rDNA, and- differ in other; regions ^(varaable 
regions) of the 16S rRNA, These differences can be exploited 
to . allow .identification. of the .different subtype strains. The 
f ull rs,equence_of V .16S. ribos.omal RN.A or DNA read f rom the .chip 
is. compared . against: a. database, of . the sequence of. thousands of 
known pathogens to type unambiguously most, nonviral; pathogens 
infecting. AIDS patients.:. \ . . 

- : In a- further embodiment, the„ invention provides chips 

:which talso , contain . probes for detection : of . bacterial; genes 
conferring: antibiotic 'resistance. An antibiotic resistance 
gene can- be .detected;' by hybridization to a- single probe . 
employed in a reverse, dot: blot, format. Alternatively,, a r group 
of ^prpbes can, be, designed according to the same principles 
discussed . above, to read all. or^ part the DNA sequence encoding 
an antibiotic-resistance - gene. Analogous-probes groups > are 
designed- for-reading otlier antibiotic.^^ gene. - .^ .r^ . 

. sequences. , Antibiotic resistance frequently resides in. one of 
the following genes in microorganisms -coparasitizing AIDS 
patients: -rpoB (encoding RNA polymerase) , katG. (encoding 
catalase- peroxidase , .^and/ DNA gyrase. A and - B ; genes . ; ; - . 
... , . The inclusion,, of ^ probes- fpr; combinations, .pf tests on a 
single chip simulates the clinical diagnosis tree that a 
physician would follow, based on. the presentation of a given 
syndrome which- could be caused by any . number . of possible , 
pathogenis- Such chips- allow identification of the presence 
and titer, of : HIV in a patient-^, ^ identification of -the HIV. 
strain, type .and^ drug resistance , ; identification of 

. opportunistic . pathogens , and identification- of^ the. drug . 
resistance of such pathogens . Thus, the physician is 
simultaneously apprised of the full spectrum of pathogens 
infecting the, patient ..and . the most, effective treatments^ 
therefor. - : " ... „ „ . . . . 



^ ■ (al HV 2 73 : • . • ' 

The PiV 273 chip contains an- array of - oligonucleotide 
probes for analysis of an 857 base- HIV amplicon* between , 
nucleotides 2090 and- 294 6 (HIVBRU -strain numbering).- The chip 
contains four groups of probes: '11 mers, 13 mers, IS.iners and 
.17 mers . From top to bottom, the HV. 273^ chip- is occupied by 
rows of 11 mers, followed by rows, of 13 mers , followed by rows 
of— 15- mers followed by rows of 17 mers. -The interrogation 
position is^ nucleotide 6,-7, 8 and 9 respectively- in- the- I 
different, sized-' chips . . This, arrangement- of . the different 
sized probes- is referred to: as. being "in series Within each 
size-group; there- are four probe sets laid down in ah A-lane, 

a C-lane-^ a. G-lane and a T-lane respectively . ^ - Each lane 

contains an overlapping series of probes-with one probe- for 
each nucleotide in the- 2090-2946 HIV reverse transcriptase 
reference sequence, (i.e.',- 857 probes per lane) . The lanes 
also include a few column positions which are -empty or ■ 
occupied by control probes. These positions serve - to orient 
the chipv determine background fluorescence and punctuate 
different' subsequences within the target The- chip -has an area" 
of 1-28 X 1.2 8 cm /'within which the probes form a*130 X 135 " 
matrix ( 17 , 550- cells total) . The area -occupied by each probe "' 
(i . e . , a :'probe - cell) is about 98 X 95 microns." 

The^ chip was tested for its-capacity to sequence a-- 
reverse • transcriptase- fragment- from the HIV strain SF2v- An 
831 bp RNA^ fragment (designated ^pPoll9 )' spanning most of the 
HIV ■ reverse transcriptase - coding sequence was amplified by 
PGR, - using primers ' tagged with* •T3 and T7"''prombter ^sequences. 
The primers, designated- RT#1-T3 and 89-391 T7- are -shown in 
Table -4;. see also -Gingeras et al ., J . - Inl , :Dis 164 -1066-1074 
(1991) (incorporated by reference in its entirety ;'f or all 
purposes) . RNA was labelled by incorporation of fluorescent 
nucleotides. The RNA was fragmented by heating and hybridized 
to the chip for 40 min at 30 degrees. Hybridization signals 
were quantified by fluorescence imaging. 

Taking the best data from the four probes sets at each 
position in the target sequence, 715 out of 821 bases were 
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read' correctly (-87%:) • (Compar isons- are based ';on the- -sequence 
of pPoll9 determined . by - the^ dideoxy^ ^method' to be 

identical to- SF2) . - In general, the longer sized probes r 
yielded more ■ sequence than* the shorter probes . ~ Of the. 21 
5 positibnis at which . the * SF2 and; BRU strains diverged within the 
target/ 19 were read -correctly . ^ ^ : ■ 

Many of the short ambiguous regions in the target arise 
- in segments of the target flankingthe points at which the SF2 
- -and BRU„ sequences diverge . These - ambiguities :.arrse because in 
10 these regions the comparison of hybridization signals is not"' 
drawn between -perfectly matched and single base mismatch . 
^ probes but between a single-mismatched probe and three probes 
. having two mismatches. These ambiguities in : reading an SF2 
sequence would; not detract from the chip ' s ability.; to-^ read a 
15 BRU sequence either alone or in a mixture . with an . SF-2 target 
^ : sequence.;' : ^ — " ■ " r^^^^^^ ..v. ... .... ^. 

In a variation of the above procedure, the chip was 
treated with RNase after hybridization of the pPoll9 target to 
the probes,- . Addition: of RNase digests mismatched target . and 
20 '..thereby^:increases^^the -signal to noise ratio: ^ RNase treatment ; 
^ increased -the number of correctly read bases to 743/821 . or 90% 
: (combining the data from the four groups of probes) . 

: In a further variation, the RNA target was replaced -with 
a -DNA target containing the same segment of the HIV genome, 
25 The DNA probe was -prepared by linear amplif ication using. Taq 
: polymerase, rRT#l-T3 primer , ^ and fluorescein d-UTP - label .; The 
DNA probe was fragmented with uracil DNA glycosylase/ and -heat 
treatment. ^ The hybridization pattern : across ^the array -and 
percentage of readable sequence were similar to those obtained 
3a .^raising : ah^^^^^^ there - were. ^a f ew r^egibns^ ■ ■ 

sequence that could -be ■ read from the RNA target- that could not 
be. -read .from the DNA target -and -vice versa. • : • - - ; - 

. - ^(h) ^HV 407 Chip -: : ■ ; ' : : 

3 5 The '4 07 ^chip was -designed , according to the same ^ - . : ' 

principles as .the HV 27 3 - chip, but -differs in several ■ ;~ - 
. respects.-- First,, -the oligonucleotide - probes -on this chip are 
designed; to ..exhibit perfect sequence - identity .-(with the 
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■ ^' exception of the* interrogation' 'position' on each probe) ' to the 
• -HIV strain 'SF2 ' (rather than the BRU strain as was the 'case for 
the^'HV 273 chip)'. Second, the 407 chip contains 13 mers, 15 
mers/ 17 mers and 19 mers (with interrogation positions^ at 
5 nucleotide 7, S, 9 and 10 " respectively ) , rather than' the- 11 

mers, 13 mers, 15 mers and 17mers' on the HV 273 chip. "Third, 
the • different sized groups of oligomers are arranged in 
parallel in place of *the in-series arrangement' on' the HV 273 
' chip.^ In^ the' parallel arrangement, the chip contains from top 
10 to bottom' a row" of 13 mers, a row of 15 mers, a row of 17 

mers;." a row* of 19' mers, ' followed' by a further' row' of 13 'mers, 
a* row. of 15 mers", a rowof 17 mers , a 'row of 19* mers, followed 
by a row of: 13 mers; and so forth. Each row contains ~4 ^ lanes 
of probes, an A lane, a C lane, : a g lane and a ' T lane, as 
15 described^above. -The probes in each lane tile ^ across the 

reference sequence. The layout of probes on the HV 407 chip is 
shown'^in Fig. 10. - - - - . ' ' : 

^ ' - •: The '407 chip' was separately tested for its ability to ^■ 
sequence two targets , pPoll9 RNA and 4MUT18 RNAl pP6ll9 - 
20 '-^ contains ' an 831 " bp - f raginent from the SF2 'reverse- transcriptase" 
gene which exhibits perfect complementarity to the probes on 
the 407' chip (except of course f or the interrogation * positions 
"in three of the probes in each column) . 4'MUT18 differs from 
. the-ref erence sequence at thirty-one positions -within the 
25 -target , ^ including five positions in codons 67, 70, -215'and 219- 
associated with -acquisition of drug resistance. - Target 'RNA 
- was prepared > labelled -and fragmented as described above and 
hybridized to the 'HV '407 chip . The - hybridization pattern for 
the pPdll9 target is shown • in Fig . ll.'" " ' - * 

30 'The -sequences- read of f the chip for the pPdll9 and 4MUT18 ' 

' . targets " are ' both shown in Fig. 12 (although the- two sequences 
were determined in different experiments) • The sequence ' 
labelled wildtype in the Figure is the reference sequence. 
The four lanes of sequence immediately "below the reference 
35 sequence are the respective sequences read from the four-sized 
groups'" of -probes* for- the'pPollS -target - (from top-to-bott'om, 13 
' mers', -15 mers 17 "mers - arid i9" mers)'. The next four lanes of 
sequence are -.the "sequences read' 'frcn the four-si'ized"' 'groups of 
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.:prbbes' f or -.the > 4MTJT18 target .(f rom top-to-bottom in .the same ' 
order) , . The ■ regiphs of sequences. shown in normal type are 
those that .could be read, unambiguously from the chip., Regions 
where .sequence .could not be accurately -read are shown, .. 
highlighted.. ..Some .regions .. of . sequence that could not be read 
from.., one sized set of. probes could.be read from another .- 

... .Taking, the best result fro.m the four '.sized groups . of 
probes at each column position, about 97%, of bases in the 
pPoll9 ..sequence: and- about 90% : of .-bases -in the- .4MUT18 sequence 
were read accurately.. Of the 31 nucleotide differences • 
between 4MUT18 and. the reference sequence, twenty-seven were 
read, correctly including three of the .nucleotide. changes 
^ associated with acquisition of drug resistance.. Of ,the . 
ambiguous regions in the 4MUT18 sequence determination,, most 
occurred in the 4MUT18 segments., flanking points, of diyergence: 
between the 4MUT18- and reference sequences. Notably, most of 
the common mutations in HIV reverse transcriptase associated 
with drug resistance (see Table 3) occur at. sequence positions 
that .-can be read; from the chip. Thus,. ..most of the., commonly - 
;.occurring --mutations can be . detected-, by- a, chip, containing, an 
.array.;.of probes based on a single reference sequence. . . 

Comparison of the sequence read of the probes, of- 
different sizes is useful in determining the . optimum, size 

probe to ..use for different regions of .the target The... 

strategy . of - customizing probe length within. a, single group of . - 
_probe sets minimizes the total, number of . probes, required, to 
read. .a particular target sequence. . This .leaves ample capacity 
for the chip to include probes to other reference sequences 
(e.g. , ..16S RNA. for pathogenic microorganisms) . as discussed 
''below .~; r,.'..'. ^ ■' •". ',., \ .'^r* ' . " "..'^ .". ""''' '/ '.' ' ' l'^ ^ ' .S' . " ^'^ " ■ '- ' ' 

HV. 4 07. chip^ has also, been, tested, for its . capacity to 
detect ^.mixtures of different HIV strains .... .The mixture. . 
comprises varying proportions,, of ..two target, sequences ;. one a 
segment of a reverse transcriptase gene from. a. wildtype- SF2 
strain, the other a corresponding segment from an SF2 strain 
bearing a. codon 67 mutation. See, Fig. 13. The Figure also 
represents the probes on the chip having an interrogation 
position for reading the nucleotide in which the mutation 
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bcclirs-.'- A -single- probe -in- the* Figure represents • four probes 
* on the chip with the symbol' (o) - indicating the interrogation 
position, which differs in each of the four probes. ^Figure. 14 
shows the fluorescence intensity for the four- 13.mers.and the 
5 four 15 mers having an; interrogation position for reading the: 
nucleotide in* the target seguence in -which the mutation 
occurs.- As the percentage of mutant target is increase, the 
fluorescence intensity of the probe exhibiting perfect 
complementarity to the wildtype target decreases, and the 

10 intensity- of the probe exhibiting perfect- complementarity to 
the ■mutant seguence increases. The - intensities of the other 
two ^probes do • not change ■ appreciably. '^It * is concluded -that 
the-chip can be used to' analyze simultaneously a mixture ^of 
strains;'^ and that a' strain -comprising* as little as ten-percent 

15 of a mixture can be" easily detected;* - - 

• ' - •• • c , - Protease Chip ■ 

" a' protease chip was ' constructed using the basic tiling 

strategy; The chip comprises- four probes • tiling across a 382 f 
20 • nucleotide span including- 297 • nucleotides from the protease 

coding sequence'.- The reference- sequence was a consensus Clay-v, 
B HIV protease sequence- Different probes lengths ^were 
employed for tiling different regions of the reference - 
sequence.' Probe- lengths were 11 , 14 ; 17 and 20 nucleotides 
25 with interrogation positions at or adjacent to the -center of ' 
' each probe. - Lengths' were optimized' from prior hybridization 
' data employing- a chip having- multiple 'tilings, -each- with a 
different-probe- length. . ■ 
- ' -'The chip was hybridized" to four ^different single-stranded 
30 DNA protease target sequences (HXB2, SF2 , NY5, pPol4mutl8.) . 

Both- sense and • antisense strands were sequenced. '-Data from 
the- chip was ' compared -with that from an - ABI 'sequencer The 
overall^ accuracy from ^ sequencing the-four targets - is •; - 
' illustrated in the Table 5 'below . • ■ ' • ■ ' " "■ - - • - • - 
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-ABI_ Protease Chip 



Sense' Antisense • Sense --^ -■Ant-i^^^ 



No call 0 4 



9 4 
8 



Ambiguous ' 6 * 14 ^' - 17' 

Wrong - call : 2 . 3 , • . . . 3 , . . . \ 

TOTAL 8 21* = 2 9 ' ^13:- 



tABI. (sense) - 99.5% 
Chip' (sense) - 98.1% 

ABI (antisense) - 98.6% 
Chip (antisense) - 99 . 1% 



Combining the data from sense and" antisense' strands^ both the 
^^^^ .^"^ ^^"^^^^^^^ provided, 100%' accurate' data' tor all 

of the sequence from all four clones;^-. 1 'V : ' " ' - 

*In a further test, the' chip was hybridized to 'protease 
target sequences from viral isolates obtained from four 
25 patients before and, after ddl treatment . The seqioehce read 
from the chip is shown in Fig . ^ 15 . Several' mutations " ' 
^^^^^^^^W^^ have' arisen' in the samples obtained " ' 

Particularly noteworthy was the chip's ' 
capacity to read a g/a mutation at nucleotide 267/ ^ 
notwithstanding the presence of two additional mutations (gt) 
at adjacent positions. 



B, Cvstic Fibrosis Chip s 'V ' ; : ^ " * ^. 

.Z,^./'^'^^^^ ago/ cystic fibrosis/ the ihost'c^ 

^"^^^ autosomal recessive disorder ' in humansV was' shown ' to be 
V ; -^^^^^ thereafter :hame^^^^ 

Cystic Fibrosis Transmembrane Conductance 'Regulator (CFTR) ^ 
"3^^^^ 'TYiB CTTV. gene is about 250 kb in size' and hks 27' exons. 
Wildtype genomia sequence is' available for all exoriic regions 
. ^^^"^ exons/lntron-boundaries' (Zlelenski "et' al Geno;ni(7s 10 , 
214-228 (1991).' The full-length wildtype cDNA sequence 'has 
also been described (see Riordan et al., Science 245, 1059- 
1065 (1989). Over 4 00 mutations have been mapped (see Tsui et 
al, Hu. Mutat. 1, 197-203 (1992). Many of the more common 
45, mutations -are shown in Table 6. The most .common ' cystic 
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fibrosis mutation is a three-base deletion resulting in the 
omission of amino acid .#508" from the CFTR protein. The 
frequency of mutations varies widely in populations of 
different geographic or ethnic . origin (see column 4. of. 
Table 6). About 90% of all mutations having phenotypic ' 
effects occur in coding regions. . . - 

Detection of CFTR mutations is useful in a number of 
respects. For example, screening of populations can identify 
asymptomatic heterozygous individuals. • Such individuals are 
at risk of giving rise to affected offspring suffering from CF 
if they reproduce with other such individuals:- In- utero * 
screening of fetuses is also useful in identifying fetuses 
bearing 2 CFTR mutations. Identification of such mutations 
offers the possibility, of abortion, or gene therapy. For 
couples known to be at risk of giving rise to affected 
progeny, diagnosis can be combined with in vitro reproduction 
proce.dur.es to identify an embryo having at least one wildtype • 
CF allele before implantation.. Screening children shortly 
after birth is. also of value in identifying those having 
2 copies of^the defective gene. Early detection allows 
administration of appropriate treatment (e.g., Pulmozyme 
Antibiotics, Pertussive Therapy) thereby improving the quality 
of life and perhaps prolonging the life expectancy of an 
individual • 

The source of target DNA for detecting of CFTR mutations 
is usually genomic. In adults samples can conveniently be 
obtained from blood or mouthwash epithelial cells. .In 
fetuses, samples can be obtained by several conventional 
techniques such as amniocentesis, chorionic villus sampling or 
fetal blood sampling- . At birth, blood from the amniotic , chord 
is a useful tissue source. 

The target DNA is usually amplified by PCR. Some 
appropriate pairs of primers for . amplifying segments of DNA 
including the sites of known mutations are listed in Tables 5 
and 6 - ' 
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Table 7 ' ' - ■ ^ - r --.- - ....... . ■ . 


■ 


OLIGO NUMBER 


SEQUENCE 






i V- i 1 i LjLa A 1 A 1 AL, i 1 G 1 G I G AATC AA 




/ o o 


r\v„WrtOr\ X X 1, 0 ± X \^ 1. ± X X. V^A J. A 


5 


o ^ u. 


n T P'TT T TT* A 21 a T'T' P'T a T> A <T> 
VJ X w X X 0 X w i. jl yj/\AA J. X ^ 1 ^ Alj wVj 1 A X 






^ x X 0 X wrtVjv- X wrtv— X AV— ^ X AA X 




G fl "7 


X oAUAALiAi AU i AAGC-x AGATGAA 






AAU i X X J. L- wAG i i GTAT 




OQQ 

o o o 




17 -3 ** 


J- i. AL3 1 X X U 1 ALjGGG i GGAAGATACA ■ 


^ \j 


7 J O - . . , , . ., , 


X X -ft-rt. xajav^ A^ X 0 aavjA x-v-AL. X G J. 1 CT AT 






^-^Ax 1 U^AAGAx X GAi AxTTGAA 






GGAGAx i X 1 IGCAAAGTTCATTAGA 




O ^ X 






R Q !> 


■rtw v« X X L.U.AL3U.AL. X AGAA-AG X AGAA 


15 


760 


PA A ^^T'P ft ft T'PPT'i" ft /^OT^/" AH "I'l 1 1 
V- A/\,Vj X AA X v„ X LjAIj G G X G A X X X 




850 ' 


GGT A PTPTP A A PPPTT*P A T" A TT* ft ft 
X X \j X ^AAootj X X v^A X A X GGA X A 






G ATT A P A TT A A A np A A A "PP T'P r* r"r'T"T> 






A P A T'P A A T'P ft P ft 'PT^T* ftY*^ft/^/^A A A rp/^ /^mm 
AUAX WAAXOAUAX X X AGAGGAAAJ/GCTT 




7 J X - ' ■ - . - i • 


P'PPAPPft'Pft T'T'^^'P ft ft T*/^ /^A rp/^rriA r^rnr^ y< 
X (jACUAX AX X GX AAX GGAX G.X AGTGA 


2 0 




ATP TP A A P A T* A T'T*T' r^T" ft ft ft A A 
« vJ'J orVriV-rt X A ^ X v- X ^AAvjAGG X AA 




955 


TCT PTP TPT AAft r^*Vf^ ftnp/^ nr^rn 1^7^^ > 
v^-Lv- xvjx AAA GX^ AXG GCT ,AAC a 




884 


TPPT AT AP A PTTP ft T'Trr* ft T"T»r' av* a a 
J- X /ivj/iLj X X 0 A X X wG A X X GAGAA 




8 8 5 


CCATTAAPTTA ATPTPPTPTPATPft r'ft a 
■^^c\ ± i rt-Tiv— . X X rt_rv xwXuuX U.X L.A X V— AGAA 




886 : ' 


PT A PP ATA ft TPPTPP^^/^ ft ft ft A Tt/^ A A 
^ X r\V-\-i-iX AAX X X LjLjVjAGAAAx GAA 


25 


782 . 


TCAAAGAATGGCACCAGTGTGAAA 




901 


tgcttagctaaagttaatgagttcat 
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OLIGO NUMBER . , 


SEQUENCE 


..784 . . , , 


AATTGTGAAATTGTCTGCCATTCTTAA 


785 " ' " ' " 


GATTCACTTACTGAACACAGTCTAACAA 


.791., -. 


AGGCTTCTCAGTGATCTGTTG_ 


792 


GAATCATTCAGTGGGTATAAGCA 


1013 . , 


GCCATGGTACCTATATGTCA^^ 


1012 ' 


TGCAGAGTAATATGAATTTCTTGAGTACA 


766 . 


GGG ACTCCAAATATTG.CTGTAGTAT..^ 


1065 - 


GTACCTGTTGCTCCAGGTATGTT 







10 other ' primers can be readily devised from the known 

* genomic and cDNA" sequences of CFTR. The selection of ' 
primers*/ of 'course, depends on the areas of the target 
sequence that are to be screened. The choice of primers also ^. 
depends on the "strand ' to be ' ampTif ied / " ' For some "regions "of 
15 - - . the CFTR gene, it makes little difference to .the , .hybridization* 
signal whether the coding or noncoding strand is used.* " In 
other regions, one strand may give better discrimination in 
• hybridization signals between matched and mismatched. probes 
than the other. The upper limit in the length of a segment 
2 0 that can be amplified from one pair of PGR primers is about 50 - 
kb. • Thus, for analysis of mutants through. all or much of the 
CFTR gene, it is often desirable to amplify several segments 
from several paired primers. The different segments may be 

-amplified sequent ially • or simultaneously by multiplex PGR. 

' 25 Frequently, fifteen or more' segments of the CFTR gene are 
simultaneously amplified by PGR. The primers and 
■ amplifications ' conditions are" preferably • selected to generate 
DNA targets. ' An asyitimetric labelling strategy incorporating 
f luorescently. labelled dNTPs tor random labelling and. dUTP for 
" 30 ■ "t'aYget fragmentation to "ah average length' 'of res's"*"than 60 

bases is preferred / ". The^ use\ of dUTP and^ fragmentation with 



8NSDOCI0: <WO 951 1 995A1 J_> 



78 

. . : uracil N-glycosylase ;has ,^the added ; advantage , of eliminating 
. carry, .over -between samples... ! ■ . 

Mutations in the CFTR gene can- be detected by.- any of the 
.tiling strategies noted .. above .... The block, tiling strategy is 
5 ;.one ..particularly useful , approach. In - this : strategy-, a- group. 
J Cor block) .of probes- is.used, to analyze, a: short segment of 
. contiguous nucleotides (e.g.,. 3, 5., 7 or.?) from a CFTR gene 
centered around /the site of a mutation . , The probes ;in a group 
are sometimes . referred . to. as constituting, a . block be.cause all 
10 probes ..in the group .are usually identical, except at^ their 

interrogation positions. As • noted above , the probes may also 
dif f er :in the presence . of ; leading or trailing sequences 
flanking regions of complementary. However, for ease - of 
^ illustration, it will be assumed . that .such . sequences are not 
15 ^ . present , As an example, to analyze, a .segment .of . five. . 

^ V ;;Gontigupus nucleotides from th^ site 
of a mutation (such as one of the . mutations in Table^S), a 
block of probes usually ctDntains at ^ least one wildtype probe 
-and five .sets of mutant probes , each having three . probes . The 
20 - wildtype ;pr^ ,,has ...five. interrogation . positions corresponding, 
to the ^f ive ^.nucleotides being analyzed from the reference 
sequence. However , the identity of the interrogation 
positions is only, apparent when the structure of the wildtype 
^probe .is. compared. with that of the, probes in the f ive mutant 
.probe, sets. .The first mutant, probe set., comprises. .three 
probes, each being identical : to., the. wildtype probe , except in 
the, first interrogation, position, which differs., in, each of the 
three mutant probes and the wildtype probe. . The second 
.through fifth-mutant. probe sets are similarly composed except 
that the differences from the wildtype. probe occur in- the 
.second- through .fifth interrogation position .respectively. 
Note that in- practice, each. set:of mutant probes.- is sometimes 
laid:down on.the.chip juxtaposed with an associated .wildtype 
probe. ■ In :.this situation, a, block, would /comprise . f ive 
35^ wildtype probes-, ;each effectively, providing the same 

• inf orma^tion.;. However; visual inspection : and- confidence; 
.analysis rof the chip- is facilitated by, the. largely,^ redundant 
- : -.information- provided .rby-vfiye wildtype probes. 
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- * ' After' hybridization -to labelled ' target; - the relative 
hybridization signals are read from the probes . Comparison of 
the ■ intensities of the three -probes in the first mutant probe 
set with that of the wildtype probe - indicates .the identity of 
5 the nucleotide- in the target sequence corresponding .to the 

first interrogation .position . Comparison of. the intensities 
of the- three probes in the second mutant probe set with .that 
of ' the- wildtype probe • indicates the ^^ident ity- of the nucleotide 
in- the target sequence corresponding to . the :second ■ i -^ 
10 'interrogation position and so . forth, . Collectively the 

relative hybridization intensities indicate the -identity of 
each ^of '.-the -five contiguous nucleotides in the reference^ 
sequence V • • - • 

In -a preferred embodiment, a- first group (or block) of 
15 probes- is- tiled' based on a wildtype reference sequence and a- 
second group is tiled based a mutant version of ■ the wildtype 
reference sequence. The mutation can be a point -mutation, 
'insertion or deletion or. any combination of these.- The 
combination of first and second ' groups of probes facilitates k'- 
20; analysis when multiple target • sequences * are simultaneously ; ■ 
applied ^to 'the chip, as - is the case "when a patient being 

diagnosed-is heterozygous for the CFTR allele. * 

' The above strategy- is illustrated in -Fig. 16, which shows 
two groups of probes tiled for a wildtype reference sequence'^rl 
25 and'-a point mutation thereof. • The five mutant - probe -sets f or - 
• the 'wildtype 'reference' sequence are designated wtl-5-; and the 
' five* mutant probe sets for the mutant reference -sequence are 
designated • ml-5 . The - letter N indicates the interrogation 
position, which shifts' by one position in 'successive' probe 
30 sets from the same group. The figure - illustrates' the - ^ 
' hybridization -pattern obtained when the chip is ...hybridized 
■ ' with a homozygous wildtype target sequence .comprising 
• nucleotides n-2 to n+2, where n is the site of a -mutation. 
For 'the group of probes tiled based on -the reference sequence, 
35 four' probes are compared' at- each interrogation -position.' At - 
each position, ' one of the .four probes exhibits -a :perfect match 
' ' "' With' the target , ' and the^^ other - three ^exhibit a :singl:e-base 
mismatch. For- the group * of probes tiled - based - on the -mutant 



BNSDOCID: <WO 9511995A1 J_> 



; ; . . PCT/US94/ 12305 

JW095/11995 _ ' '_ 

^.'reference -sequence, again four-, probes - are- compared at. each 
. - interrogation ..position; At position, n,- one probe exhibits a 
perfect match, and three ■ probes exhibit' a single base - . 
mismatch. .Hybridization . to a homozygous mutant yields. an 
5 analogous pattern, except that -the respective hybridization 

patterns -of probes tiled on the wildtype and mutant reference 
sequences are reversed. • 

■- • .The hybridization pattern is very different when the chip 
... is hybridized with a sample from.- a .patient who is heterozygous 
10 - for the. mutant allele (see Fig. 17) . For . the group of probes 
- tiled based on the wildtype sequence, at all .positions- but n, 
. one probe exhibits a perfect match at each interrogation 
. position^ rand the other three probes exhibit a one base 

mismatch. At position n, two probes exhibit. a perfect. match 
15 (one for each allele) , and the other probes exhibit single- 
-- - base -mismatches . . =-For the group. ^ probes tiled .on - the ..mutant 
sequence, the same result is obtained. . Thus, . the heterozygote 
point mutant is easily distinguished, from both the homozygous 

• wildtype: arid mutant forms by the identity of hybridization 
20 patternsr.from- the :two ; groups- of .probes.. ....... . 

Typically, a chip comprises -several paired groups of 

• probes, • each pair for detecting a particular mutation. For 
.example,. some chips contain 5, -10, 20, 40 or 100 paired groups 

of probes for detecting the corresponding numbers of . 

25 .mutations.. Some chips are customized to include paired groups 
. .of probes: for. detecting all. mutations common,, in particular 

populations (see Table 6)-. - Chips usually also contain, control 
• probes for verifying . that correct; amplification has occurred 
and that the target, is properly labelled. 

:3d ^ V TSie; goal , of the:- tiling - strategy described above- i.s - to- — - 
focus on short regions, of the CTFR region flanking the -sites 
of known mutation. Other tiling- strategies -analyze much 
larger regions of the CFTR gene, and are appropr;iate -for 
. locating.- and identifying hitherto .uncharacterized mutations. 

35 ■ For . example, -the entire genomic CFTR gene . (250, kb). can -be 

tiled, by the- basic- tiling strategy from an ..array- of about one 
million- probes . -Synthesis and scanning of - such, an array of 
probes is- entirely feasible... Other tiling -.strategies,, such as 
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- ' 'the- block tiling-;' multiplex tiling: or pooling . can cover the 

■entire' gene with- fewer probes. Some tiling strategies analyze 
some or all of components of the CFTR gene, such as the -cDNA 
coding sequence or individual exons . Analysis of exons 10 and 
5 il- is particularly ■ informative because these. are location of- 
• ' -'many cbimon mutations including the AF508 mutation . . • 
Exemplary CFTR chips '.\ . . : 

'One illustrative chip' bears an- array. of 1296 probes 

' cov^ririg- the f ull length of exon' 10 of "the CFTR gene ^arranged 
ro - in^a '36- X 36^ array of 356 ^m elements. ' The probes in* the 

array can* have' any length, preferably in the range of from 10 
to* -18-' residues and can - be used to detect and sequence, any 
single-base substitution and any deletion' within: the 192-base 
' exion, including" the three-base deletion known as AF508: As 
15 'described in detail below/ hybridization of nanomolar - 

concentrations of wild-type and AF508 oligonucleotide target 
^ * riucreic acids labeled with fluorescein to these arrays 
produces" highly specif ic- signals (detected with confocal 
'scannirig fluorescence microscopy) that' permit -discrimination 
20 between mutant and' wild-type target "sequences- in both-'-'': - 
"homozygous and heterozygous 'cases . • 

Sets of probes of a selected ~ length in the ' range of from' 

- id- to -18 "bases and- complementary- to' subsequences . of the . known?" 
wild-type' CFTR sequence jare synthesized starting at a position 

'25 -'^ a few*^ bases into- the' intron on the 5-' -side of exon lO and 

■'ending-' 'a few bases into the intron' on' the 3 .* .-side . : rThere is a 
'-probe'f or each possible subsequence of -the given segment of 
• ^ the- gene,' and the' probes are organized ^into a "lane.".", in. such a 
way that traversing the- lane- from the upper leftr-hand. corner 
30 ' • of the" chip to the lower righthand' corner: corresponded to 

• - traversirig' the gene segment base-by-base ' from the:- 5^^-end . The 

lane -containing that set 'of probes is, - as: noted above, .' called 
"the "wild-type lane."- : .■ : . v 

• Relative to the wild-type- lane,: a' "substitution" lane, 
35 called the "A-lane" , was "synthesized on the chip. r. .The A-lane 
■ ''probe's were ^identical in' 'sequence to" an- adjacent. .(immediat^ely 

• 'be'ldw ■ the-' corresponding)^ wild -type' :probe but- icontained*," 

' - •* -regardless- -of -the 'sequence of. the": wild.-typiB probe a : dA- 
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-residue at- posit-ion. 7-^ ( count-ing;;f rom;. the :;.3_*:-endI..,; In-similar 
fashion;-, - substitution lanes with replacement bases-.dC.,. ciG,,.and 
dT were , placed- onto the . chip :in a .-"C-lane , "_a "G-lane , *' *and a 
"T-lane, respectively.. ^ A sixth lane on the chip consisted of 
probes.\ identicarl tO' those in the wild-type lane- but for the • 
deletion of the base in position 7 and restoration of- the 
original probe length . by. addition to ;the :5.' -end. the. .base 
. complementary to the gene at that position. 

The, -f our ..substitution lanes .enable one to/ deduce othe 
sequence of a target exon- 10 nucleic acid from, the relative 
intensities . with. which- the target hybridizes to : the probes in' . 
the various lanes . - Various versions of such , exon- 1,0 DNA.'Chips 
were made as .described- above with probes 15„ -bases ^ long.,- as 
well as chips with- probes :10 • 14 and .18 :b^ long = For , the 
results described below, the^ probes were . 15, bases; :lpng and t ■ 
the- position/ of substitution- was; "7 from the 3' -end. . 

The sequences of several important probes are shown 
below. In-each case, the. letter "X" stands for the - 
interrogation position. in a given column set, so each of the 
sequences- actually.^represents four probes,/ with; A,. C.,v ,G,,„ and 
T, respectively, taking-:. -the place of the "X. . Sets^ of shorter 
probes- derived from the sets shown below, by removing up to ■ 
five^ bases f rom the. 5 V-end. of. each probe and sets of longer 
probes made from this set by adding up to . three- bases from the 
exon 10 sequence to, the 5 • -end o.f^, each, probe, ;are also useful 
and;- provided by the invention. ' - . „ 

3 • -TTTATAXTAGAAACC,... , - _ , // j 

3*- TTATAGXAGAAACCA ^ ■ ; . . . . v „ 

3 • 'TATAGTXGAAACCAC -: ^ . .. .-^ , : , / - . \ . , 

3i-.. ^.-^TAGTAGXAAGCACAA _ ^ : . • . . 

3'- AGTAGAXACCACAAA , ..^ 

3'--- ... . . GTAGAAXCCACAAAG • , • ;■ , ■ - ^ ; - ....... 

2* - ^ , TAGAAAXCACAAAGG; - . - - ^ \ ; — , , ; - ■ . ' 

3'^' r AGAAACXACAAAGGA- ■ . • - ^ ' 

. - o-^vTo/^demonstratet'the; ability • of .^the-. chip/^ tou distinguish, the 
AF508'.mutation/f rom,, the,: wild-type two .synthetic target- • 
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nucleic, acids were made . The -f irst , a 3 9-Tne'r ; complementary to 
a -subsequence of exon 10- of- the CFTR- gene having the three 
bases involved in the AF508 mutation near * its center / is 
called the "wild-type" or wt50S target, corresponds to- 
positions 111-149 of the- exon, and has the sequence shown"' 
below: • - - . . , . . - , _ - 

5 • -CATTAAAGAAAATATCATCTTTGGTGTTTCCTATGATGA. . 

The second, a 36-mer probe derived from the wi^ld-type target 
by removing those same -three baises/ is called- the "mutant" 
target* or mu508 target and- has the- sequence shown below/ ^irst ' 
with dashes to indicate the deleted bases , and- ^then without 
dashes but- with one base underlined (to indicate* the base ' ' 
detected- by -the *T-lane probe/ ^as discussed- below) ': - ■ 
5 ' -cattaaagaaaatatcat- — tggtgtttcctatgatga/' - - • ' - -* ^ • ' ' 
5 ' -cattaaagaaaatatcattggtgtttcctatgatga:- "~ - - • \ - ' * - ^ ' * - ' 
Both targets were 'labeled with fluorescein -at the 5' -end. - - 

In-three^ separate experiments/ the wild-type target, the 
mutant target,- and- an equimolar mixture of- both targets was 
exposed :{0-.l nM wt508> O.a- nM^'mu508,^ and- 0.-1- nM wtSOS- plus-'G-. 1 % 
nM mu508 , - respectively ,- in a-solution compatible- with • huc-leic 
acid hybridization) to a CF chip. - The- hybridization- mixture* 
was incubated overnight- at room temperature , - and- then * the chip 
was scanned on a reader (a confocal ' fluorescence - microscope in ^'^^ 
photon-counting- mode) >• images of the- chip* were- constructed- # 
from the photon counts)- at several'^ successively higher- 
temperatures while still in contact with- the - target- solutiohv 
After each temperature change , the chip was allowed- to- - "* 
equilibrate for approximately one-half hour • before " being - ^ 
scanned. After each set of scans, the chip ^ was ' exposed to ^ 
denaturing solvent and conditions to wash; i.e:-; -remove target 
that had bound, the chip so that 'the next- experiment ■ could^be ' 
done with a clean chip. ' - - - ~. 

The results of the experiments are" shown - in " Figures 18) - 
19, 20, and 21. Figure 18, in panels A> • B ;- - and-^ C, ^ shows ah ' ^ 
image made from the region of a DNA chip containing CFTR exon 
10 probes; in panel A, the chip was hybridized to a wild-type 
target ;: in^ panel- C-,- the-' chip- was- hybridized -tb-'a -mutant -^AF508 
target';- and- in- panel. B., the- chip' was-'hybridized^^to -a- mixture- 
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of rthe .wild-type- and mutant- targets^. . Figure ^- 1-9 , .in sheets 1 - 
_3/ ^corresponding to pa A, ^B, and ;C of Figure 3, - shows 

graphs, of fluorescence intensity versus -tiling position. The 
labels on-. the horizontal axis show- the bases - in the wild-type 
5 sequence corresponding- to. the^ position of substitution in^the 
'.respective probes. Plotted are the intensities observed from 
the features (or /syntheses sites), / containing wild-type -probes / 
the features^. containing the substitution probes that bound the 
most -target (:"called") , and the feature containing the ^ 
10 substitution probes that bound the target with- the second 
highest intensity of -all the substitution^ probes ( "2nd 

Highest^')^, v ; ^ ■ . : - 

.'These figures; show that> for the wild- type target :and ..the 
eguimolar mixture of; -targets, the substitution probe, with a ^ 
15 nucleotide - sequence- identical to the .corresponding wild-type 
-probe boundv the most target, allowing for an unambiguous 
assignment of target sequence as shown by letters, near the 
points on the curve. The target wt508- thus hybridized to the 
probes - in- the wild-type, lanet of the chip , although the 
20 strength of , the hybridization varied from, probe-to-probe, 

, probably duej to differences in- melting, temperature . - The. - 
sequence;- of - most of the target; can thus be read directly from 
the- chip, by inference from the pattern of; hybridization in • 
the lanes of- substitution probes (if the target hybridizes. 
25 most intensely-; to thev probe in the A-lane, -then one inf ers- 
that the ^ target has ^ a T- in the.- position of substitution ,^ and 
so on). . V./. -,. . : : ■ - ■ ■■ - . -.-r " 

..For r the. mutant - target , the sequence could similarly be • 
called on the 3'-side of the deletion. However, the, intensity 
2 d; : of binding . declined /precipi^t^ as the ^ point of. substitution 

approached the ^- site^. of the deletion^-f rom;, the^ 3 .» -end., of the 
target, • so that- the binding^ intensity on, the. wild-type probe- 
whose point . of - substitution-: corresponds to the T at the 3 • -end 
of the; deletion: was very/close to background.- Following .that 
35 pattern, : the wild-type, probe whose point::Of- substitution : ; -, 
corresponds-, to . the middle base, (also ^ a T) of . the; deleti.on 
bound; still ^ less^ target,;;.: However, - the; prob,e:^ in- the , T-lane- of 
that -icolumn: set bound.: the-, target- very^ well-. . -? Examination- of - 
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the- sequences -of the -two targets reveals 'that -the" -deletion - 
places an A at that position when the sequences are -aligned at 
-their- 3 • -ends and that the ^T-lane probe is complementary to^ ■ 
the mutant target with but two mismatches near: an end (shown 
below in lower-case letters ; -with the position of: substitution 
uriderLin'ed) - - - ^ — ' ■ ■ ' ^ i r 

Target:- . 5 • -CATTAAAGAAAATATCATTGGTGTTTCCTATGATGA - ^ - ^ - 

Probe: • • ' 3 • -TagTAGTAACCACAA 

Thus the T-lane probe in^ that column set' calls the 'correct - 
base from the mutant sequence. Note that, in the graph-" -for- ' • 
the equimo'lar^ mixture of ' thei two targets /- *that T-lane probe- - 
binds almost as much target as does the A-lane probe in-the- - 
same -column' set , whereas in the other column sets-, the' probes 
that 'do- not have- wild-type sequence do-not bind^ target at -a 11 
as -well. Thus, "that- one column set, and in particular the^ - 
T-lane probe within that' set, detects* the' AFSOS^ mutation under 
conditions^ that simulate the homozygous case and also 
conditions that simulate the heterozygous case . * ^ ^- * - - 

Althbugh^in- this- example the sequence could -hot' be • 
reliably deduced- near 'the- ends' of -the- target; where there is- 
not enough overlap between target and -probe to"' allow effective 
hybridization, ^ and around the center- of the target , ' where - 
hybridization was weak for some other reasbh", perhaps high - 
AT-contehtv the results show the method and the probes of the 
invention can be -used to' detect - the mutation - of interest. The 
mutant target gave a pattern' of hybridization that - was very- • 
similar to that of the wtSOB target at the ends, where the two 
share a -common -sequence,-' and very diff erent - in the middle, 
where' the deletion is located. - As one- scans" the- image f rom • 
right -to-left, the- intensity' of hybridization- of ■ the'- target -to 
the- probes in the wild-type lane drops of f - much moris 'rapidly 
near'^the center of- the image for- mu50S than - for wtSOS ; in, ^' - 
addition, there- is one probe -in- the T-lane'- that- hybridizes*^' 
intensely-with mu508 - and* -hardly- at all with- wt508; -The-- - 
results -f rom the - eguimolar mixture of- the two- targets , -*which- 
represents- the case one would encounter - in testing- a-- - 'y-- — - 
heterozygous- Individual-" for- the mutation-, "'are- a-blend-' of the- 
results f or^ the^ sep'arate^ tar get si - -showing the power - of '-the 
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invention - to distinguish -a- wild-type target ..sequence .f rom^one 
containing the AF508; inutatiqn and to detect a mixture .of /the 
two sequences," - : ; -v --- - . .. . 

' . The results above clearly demonstrate hoW;the DNA chips 
of -the. invention can be used to detect a deletion mutation ^ - 
AF5087' another" model system was- used to show that the -^chips 
can also be used to detect a point mutation as well . One 
mutation in the CFTR- gene is G480C, which involves the 
replacement of the G in position 46 of-exon 10 bya.T, 
resulting ;in the substitution of a cysteine for the glycine 
normally in position #480 of the CFTR protein. -The. model 
target sequences included the 21-mer probe - wt4 8 0 to represeint 
the wild-type sequence- at positions 37-55, of exon 10 : w - 
51-CCTTCAGAGGGTAAAATTAAG and the 21-mer.vProbe mu480 to 
represent the mutant, sequence: - - . 

5-' -GCTTCAGAGTGTA/ffiATTAAGT-^™^^^^^-^ / ~ - t. 

In separate experiments, a DNA chip was hy;;br^di zed to 
each of the targets wt480 and mu480, respectivelyr^^and then 
scanned with a cpnfocal microscope . Figure 20,- in panels A, 
B> :andr C,*- shows .an. ;image -made, from the -region . of a DNA chip - ^ 
containing CFTR- exon 10 probes;* in panel Av the chip was 
hybridized . to .the wt480 target; in panel C, the chip -was .. 
hybridized to the.mu480 target; and in panel B, the. chip was 
hybridized to a mixture . of the wild-type and . mutant targets . 
Figure .21:,: ...in sheets 1 - 3, corresponding to panels A,. B, . and 
C of- Figure 20/ shows graphs of fluorescence intensity versus 
tiling positiohi- The^ labels on the horizontal axis . show the 
bases in the wild-type sequence corresponding to the- position 
of substitution in the respective probes. . Plotted are the. 
intensities observed^ f rom ^ the features ^^( or -synthesis . sites) 
containing wild-type probes/ the features containing the - . - 
substitution probes that bound the most target .( "called" and 
the/feature containing the substitution probes that - bound the 
target :with ' the second highest intensity. of all the - - . - . 

substitution probes ( "2nd ^ Highest" ) . - - ■ „. . - - 

V.' .These-.fdgures , show that; the -chip could be used ^to 
sequence -a :16-base : stretch, from the center -of the., target wt480 
and "that . discrimination against mismatches is quite - good ' - 
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throughout the sequenced region. - When -the DNA chip was 
exposed to the target it\u480, only one probe in the portion of 
the chip shown bound the target well: the probe in- the set- of 
probes devoted to identifying the base at position "46 in exon 
5 10 and that has an A in' the position of substitution and so -is - 
fully -complementary to the central portion of the mutant v : 
target. -All other probes in that region of the chip have at 
least one mismatch with the mutant target -and -therefore bind 
much less of it. "In spite' of that fact, the sequence of- mu4 8 0 

10 for several positions -to -both sides of- the mutation can be* *^ 
read- from the chip/' albeit with much-reduced intensities from 
those -observed with the wild-type target. - - • 

The results -also show that, when the two targets were: : . 
mixed together and exposed to the chip/ the hybridizatidn * 

15 pattern observed was a combination of the' other tw;b ^patterns. 
The wild-type sequence could easily be read frdm^th'e chip'/ but 
the 'probe -'th'at^&ouhd "the mu480 target so well when only the 
mu480 target :was- ^present also bound it well when both the ■ 
mutant 'arid wild-type targets were present in- a mixture/ making : 

20 the hybridization rpatterri easily distinguishable "from- that of 
the wiid-type target alone.' These -results again show the 
power ' of the DNA chips • of the invention to detect point - ~^ ' 
mutations in both homo- and heterozygous individuals - -v 
'■'To demonstrate clinical application ■ of the DNA chips .of -vk 

25 the invention/ the chips were used to study arid detect ' -^ ' 
mutations in nucleic acids -from genomic samples.- Genomic- - 
samples'from a individual-carrying orily the wild-type- gene and 
an individual heterozygous for AF508 were amplified^ by PGR' 
using exon -10^ primers -containing the promoter ' for '■T7' ^RNA- * 

30 polymerase,* 'Illustrative -primers ■ of ■ the invention ^are' "shown 
below.'- V • ' ■ - *• ' ■ *^?~* .'^ 

Exon Name Sequence ' ' ' -^v . - : 

Id - ■ CFi9-T7- '•TAATACGACTCACTATAGGGAGatgacctaataatgatgggttt 

10 CFil0c-T7 TAATACGACTCACTATAGGGAGtagtgtgaagggttcatatgc- * 
3 5 10 CFil0c-T3 CTCGGAATTAACCCTCACTAAAGGtagtgtgaagggttcatatgc 

11 " CFil0-T7 • TAATACGACTCACTATAGGGAGagcatactaaaagtgactctc 
11 ' CFilic-T7 TAATACGAGTCACTATAGGGAGacatgaatgacattta'cagcaa' 

11 • 'CFillc-T3 CGGAATTAAGCCTCAGTAAAGGacatgaatgacattta'cagcaa • 
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These: priTners- can: be - used to ampli.fy., exon ,_.10 .or. exon ,11 
sequences; in another, embodiment, multiplex PGR .is employed, 
using tvsTo or more: pairs of primers to amplify more than . one 
exon at. a ;time.;_ - 
5 The product of amplification .was. then used as a , tempi ate 

for- the RNA^ polymerase , with f luoresceinated UTP present to 
label .the RNA ^product. .-.After sufficient RNA was. made, .-it was 
fragmented . and. applied, to an exon 10 DNA chip, for 15 minutes, 
.after-: which the chip- was washed with hybridization .buff er and 

10 scanned, with the ; fluorescence microscope, ..A useful .positive 
control included on many- CF exon 10 chips is .the 8-mer . 
3 ' -CGCCGCCG-5 ' .Figure 22, -in panels A and B, shows, an^ image 
made from, a. region of a/ DNA chip.- containing. CFTR. exon 10 
probes; .in: panel A, the, chip was hybridized, to . nucleic acid . 

15 derived from the genomic DNA of an individual with .wild-type 
AF508 -segiiences; in panel-^-B^ the target nucleic-acid 
originated from a heterozygous (with respect to the. AF5p8 
mutation) individual . Figure 23, in sheets 1,. and ,2 , 
.. corresponding to panels A and B of Figure 22„; shows graphs of 

20 fluorescence; /intensity versus tiling .position... - . . . , . 

These figures show that the sequence of the ,wild-type RNA 
can be. called- for most of the bases near .the mutation. In the 
case of the AF508 heterozygous, carrier , .one particular probe, 
the same one that distinguished so -.clearly between the 

25 wild-type-.and mutant oligonucleotide targets . in the model.,,, 
system .described above , in - 1^ binds. a large amount of 

RNA> while the. -same probe binds little RNA from the wild-type 
individual. -These results show , that the DNA .chips pf^the .. 
invention are capable of detecting the .AF508 mutation in > 

30' heteroLzygp • , V * / -l . ?: * . , T'T . f - ' 

Further chips ..were constructed using . the, block tiling , 
strategy^ to provide an array of probes for_analyzing a CFTR., 
mutation.. . The -array comprised . 93 mm.x 96 ^m features arranged 
into eleven .columns and four rows (.44 total probes). .. , Probes 

35 in. f ive of— these :,columns were from four probe sets, tiled, based 
on .the: wildtype -CFTR sequence and having - interrogation . - 
positi:onsveorresponding to -the site .of , a -.mutation and .two- 
baseSi:,o.n either .side., .p}ive -o.f the remaining,, columns contained 
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four sets of probes tiled' based on the mutant version of the, 
CFTR sequence. These- probe sets also had- -interrogation - . 
positions corresponding to the site of mutation and -two 
nucleotides on either side. The eleventh column contained 
5 four cells for control probes. : 

Fluorescently labeled hybridization targets were prepared 
by PGR amplification. 100 of genomic DNA, / 0.4 -^M of each 
primer, 50 mM each* dATP, dCTP, dCTP and- dUTP (Pharmacia) -n ' 
lOmM Tris-Cl; pH 8 . 3 / 50 itiM- KCl , ' 2 . 5 mM- MgCl2 and 2-^ Tag - - . . 

10 polymerase (Perkin-Elmer)- were cycled 36 times using a -perkin- 
Elmer 9600 thermocycler and the following times ^and ^-' — 
temperatures: gS'^Cy 10 sec . , 55^C, 10^ sec . , 30 sec. 10 

/il of • this reaction product was used- as a template, in a" * ' . 
second/ asymmetric PGR reaction. Conditions included a^M- 

15 asymmetric PGR primer,- 50 \M each dATP, ■ dGTP,- TTPy 25 • ■ - • 

f luorescein-dGTP (DuPont) y 10 mM Tris-GL, pH 9.1y-75-mM KGl,-- 
3.5 'mM MgGl2* The reaction was cycled- 5X with the following 
conditions: ' 95°G, -10 sec,* 60°G, 10 sec, 55°G, 1- min.' and 72°G, ^- 
1.5 min;* 'This* was immediately followed with another -20^ cycles t"' 

20 using the following conditions: 95°G, 10 sec, - 60«»G>-10' sec.', 
72<»G;- '1,5 min. . - - ■ " 

' Amplification products were fragmented -by treating- 

with 2 U of Ura'cir-N-glycosylase (Gibco)- at -30*'G' for 30 min. 
followed' by heat denaturation' at 95°G 'for 5 min. Finally, the i*^- 

25 labeled'," fragmented PGR" product was diluted into hybridization 
buffer made up of 5" X SSPE and ■ 1- mM Getyltrimethylammonium . 
Bromide (GTAB) . .' The dilution factor ranged** from lOx- tO' 25x -' 
with 40 " Ml of sample being diluted into. 0.4- -ml to a- ml of - 
hybridization ■ solution. * *' . . : . - 

30 Target hybridization was generally -carried out with 

the 'chip shaking 'in a small dish -containing 500 Ml to 1 ml 
total volume of ' hybridization solution. ' All 'hybridizations- ■ 
were done "at '30°G constant temperature.' Alternatively^, -some 
hybridizations were carried out with chips enclosed in a 

35 plastic package with the 1 cm x 1 cm' chip -glued f acing ,a- 250 

Ml fluid chamber; 250-350 ^i of hybridization ^solution -was- -. 

introduced and^mixed -using - a 'syringe pump;- • Temperature was 
controlled by -'interfacing the back surface- of the package- -with 
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. Peltier heating/cooling-- device... . Following hybridization ., 
chips/ were vashed with 5X^ SSPE,, 0.1%- Triton: X-IOO. at 25''C'^3 0^C 
prior- to fluorescent' image generation. ... 

•Hybridized, washed DNA chips were scanned for . 
fluorescence using a stage-scanning- conf ocal. epif luorescent 
microscope and 488nm: argon ion laser* excitation. , Emitted . 
light wasi collected through- a band pass filter centered at 
530nM. The resulting- fluorescence image .was spatially 
reconstructed and. intensity data were -then analyzed... Features 
with the peak fluorescence intensity in each column were 
identified- and- compared with any signal, intensity at the 
remaining single base mismatch probe sites in the. same column. 
The sequences. ^ of- the- highest intensity features -were -then . 
compared- across all ten. columns, of each sub-array .to. determine 
whether ' peak; intensity scores for, the wild, type" sequence, and- 
the ^ mutant sequence/were similrar^or^^^^^^^^ different". 
These results were - used to generate the genotype call of wild 
type (high intensity signals only in wild type probe, columns) , 
mutant (high- Intensity signals only, an; the . mutant probe. - 
columns) . or . heterozygous.. (high intensity- signals in. both the 
wild type- -and -mutant probe columns) . 

• Figure 24 ^(panel A) shows an image- of the fluorescence 
signals in arrays designed to detect the G551D(G>A)' and- ^ v 
Q552X(C>T) CFTR mutations The hybridization target is an 
exon .11- amplicon generated from wild type genomic DNA. .. wild 
type, hybridization.-patterns are . evident at both iocations . - No 
significant' fluorescence- signal resulted , at any. of the : . 
features; with probes complementary to mutant or- mismatched 
sequences. > Relative fluorescence intensities were six fold 
brighter for the^ perfect matched wildtype features compared 
with- the, background signal intensity at mutant and mismatch 
f eatures. -In addition, the .sequence at these loci- can -be- ^ 
confirmed, as AGGTG, and GTCAA, respectively, ^where- the bold ' 
type- face indicates the. mutation sites Figure 24 (panel B)- 
shows the same probe array features after hybridization .with a - 
fluorescent- target generated from DNA-heterozygous for the 
G551D .mutation. . Both, the wild type and^ mutant probe: columns 
have features- with ^ significant • fluorescence i'intensity-,-' ^ . 
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indicating the- hybridization of both- wild- type and mutant: CFTR 
alleles at this site.- ■ Only wildtype • probes hybridized with 
any significant fluorescence signal -in the* Q552X subarray' 
indicating a- wild type target sequence. - However, an 
5 additional feature that' did not hybridize in the first . - • .* 
experiirient shows significant fluorescence' intensity- in-this ' 
experiment'. ' Because the G551D-- and Q552X mutations are: only* ■ 
two base's apart , the a probe sequence in the* additional • ' " ' - 
feature has* a' perfectly matched 12-mer overlap ' with the- mutant 

10 G5 5 id' target. " ^ ' - . . . ■ ^-w;;:.. / 

: -Figure 25 (panels A' and B) illustrates mutation 
analysis -"for" AF508 a three "base pair deletion in'-Exon- 10 of- 
the CFTR gene. -In contrast to ' the* hybridization pattern ' seen 
in base change mutations, in mutations where-^ base's are' 

15 insertis'd o*r^ deleted probe" arrays- show a different-- 

hybridiza^ion pattern.- Identical probes are synthesized 'in 
the* two ^central columns' of ' baise substitution - arrays^ - - As a 
result/-' el'ther mutant or wild' type target: hybridizations 
always -result in two side-by-'side features (ia doublet) with 

20 high -f lubrescence .iTitehsitV' at the" center- bf" the array l''*--In-'a 
heterozygote hybridization, -two sets of doublets , one'^matched 
to the wild type- sequence and one to the- mutant sequence occur 
(Figure- 24 , panel' B) In'^ contrast,^ wild type and mutant probe 
column -sequehces" are- offset^ from each other - for- deletion or ■ 

25 i'hsertidn mutations and hybridization doublets are- not- seen.'" 
' Instead' of- the six high intensity signal's with one doublet , , ^ 
five independent' features- in- alternating* columns" characterize 
a hombzygote and ten features, one -in* each- column will" be- - ' 
positive'* with' heterozygote targets." - This- is-*" evident • from - the 

30 AFSOS"^ hybridization pattern'* in Figure ' 25 >' panel* AC *^ -Although - a 
wil'dtype target* has ' been hybridized- and the' highest - intensity 
features 'conf irm -the- wild type sequence"- (ATCTT) there " is- an 
additional^ hybridization in. the first" mutant- column.-* Analysis 
of that probe sequence shows a '10 base* perfect match with the 

3 5 mutant sequence. ' • - " ' " . . .. - _ ... v. - . ^ 

■; The 'iiriage 'in' Figure 2 5, ''pahel-' B- resulted '-from* ^ 

hybridizing^'a" DNA'^chip" with 'a ta-rget- homozygous'' f 6r-'AF508 . *• - in 
this image- five* features /- all with- probe- 'se'quenc'es'^''-'-^'- ' 
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compiementary. ~to- ..the ;mutant: -.show signif icant . signal . . .The - ■ 
mutation sequence .bridging the deletion site, ATTGG/ is ■ 
conf irmed..: .Similar; to .what, was .seen in the :example -of the 
G551D muta.tion, .there is added information in. neighboring , 
5 subarrays- designed- to detect the. AI507 and -.ESOSC mutations . ,. 
This is expected since . they are. in such close proximity - to . -. 
AF508 that their -probe ..sets significantly overlap the . AF508 
probes.. . The. AF508. homozygous target has no perfect . matches . 
with wild type or mutant, probes in the aI507. ■ and .;F508C. .. . 

10 subarrays. ..However, there are some low intensity .signals . 
within these two blocks of probes.. The F508C. array ..has a 
doublet .that matches ll bases of the mutant .AF5 08 target. . ., 
Similarly,- the hybridization in .the eighth column of the aI507 
array has; a. probe, ithat matches .13/14 bases with the /target.., 

15 Figure 26 shows hybridization. of a heterozygous double.. 

mutant-AF508/F508.G to the- same arr^ as described above. 
Conventional reverse dot blot would score this sample as a . 
homozygous AF508 mutant. In the present assays, the AF508 .and 
F.50.8C, alleles are separately detected by the respective , : ■, 

2 0 subarrays. designed to detect these; mutations .; . - - . 

S-s: Chios- for Cancer Diagnosis 

..There are. at ..least two . types of genes which, are often ., 
altered, in cancerous cells. The first type of gene is an 
25 oncogene such as a mismatch-repair gene, and -the second: type 
of gene. is. a -tumor . suppressor gene such as a transcription 
factor.,.: Examples of ..mismatch .repair, oncogenes .genes include 
hMSH2 (Fishel et al., Celi 75 ,- 1027-103,8 (1993 ) ) and .hMLHl _ 
(Papadopoulos et al . , Scierjce 263', .1625-1628 -(1994 ) ) . . The - . 
mbst..wellr.known example of a tumor .suppressor gene .is the p53 . . 
protein gene . (.Buchman. et al Gene- 7.0 ,. 245-252: (-1988 )... : By - 
monitoring the state .of both oncogenes and -.tumor, suppressor.. 
• genes- ..(individually and in combination,) in a^ .-patient,-, it is. 
possible to determine individual susceptibility to .a cancer,- a 
35 patient's: prognosis upon cancer diagnosis, .and ..to -target 
therapy more" efficiently. . ::■ . - .■ ..-^ . •.- , . . ; ... 

• The pSJ . gene -^spans...20- kbp in . humans- . and Jhas, -11 .exons/ .-,10 
of which. are:-.protein'. coding :{.see. T.oainag.a.-et:..al-. , ::1992;,.- .-. .:. 
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Critical Reviews '-in 'Oncogenesis 3 :257-282,. -incorporated .herein 
by reference) The gene produces a 53 kilodalton 
phosphoprotein that regulates DNA replication. .The protein.- 
acts -to halt replication at the Gl/S . boundary in the cell 
5 cycle and is believed to act as a . "molecular policeman/" ..^ 
shutting - down replication when .the- DNA -is damaged or blocking 
the reproduction of DNA viruses (see .:Lane , , .1992 , Nature , 
358:15^16,' incorporated herein by., reference) . ...The p53 
transcription factor is part of a fundamental pathway, which 

10 controls .cell growth. Wild-type p53 -can halt cell growth,: or 
in some ; cases, bring about programmed cell .death (apoptosis) 
Such turaor-suppressive effects are absent . in a variety of .y 
known. p53 gene mutations. : Moreover/. -p5.3 mutants not- only s\. 
deprive a ^cell -ot wild-type p53 tumor suppression,, they also, 

15 may. spur abnormal : cell growth. 

In -tumor cells p5 3 is .the most commonly, mutated, gene 
discovered . to. date (see Levineetal./.* 1991/ Nature 
'351:453-456, .and Hollstein et al., 19,9 1, • .Science 253.:49-53, 
each of - which -.is -incorporated -herein by :.ref erence-) - Over- half 

20 of the 6.5 million '••patients diagnosed with cancer annually — 
possess p53 mutations in their tumor cells. Among common 
tumors, about 70% of colorectal cancers , 50% of -lung: *cancers . 
and 40% of breast cancers contain p53 mutations. In. all, over 
51 types of. human tumors -have - been documented to possess p53 

25 mutations,' including bladder, brain, .breast, cervix, colon; ' 
esophagus, larynx, liver,, lung, ovary pancreas, prostate, 
skin, stomach, and 'thyroid tumors (Culotta & .Koshland Science 
262 1958-1961 ' (1993)-; :Rodrigues et. al . , 1990,, PNAS . ' r 
87 : 7555-7559 , incorporated herein' by reference).:, - .According . to 

30 -data -presented by^.David' Sidransky (,1992, , San. Diego*. Conference) , 
over 4 00 .mutations in p5 3 are .known. ..The presence of -a; p5 3" 
mutation -in a .tumor has -also been correlated with a .patient .'s 
prognosis. ^ .Patients who possess .p53 mutations have :a,. lower 5- 
year -^survival rate.-. ; : . .. . • 

3 5 Proper - diagnosis . of the form .of : p5 3 in : tumor .cells is * . 

critical to clinicians to prescribe .appropriate, therapeutic." 
regimens . For r.instance ,.::patients- with . breast_cancer .who show 
no invas.ion of nearby lymph nodes ; generally, do.- not.; .relapse v.: 
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af.ter .standard -surgical^ treatment and '^chemotherapy _ -Qf;' the;-, 
25% who *dai , relapse :^after su and chemotherapy y .additional 

chemotherapy is appropriate.- At present,, there is no clear,: 
way to- determine, which patients, will ■benefit . from such;- - ,.; 
5 additional chemotherapy: prior to- relapse .. \ However , / 

correlatingvp53 mutations to -tumorigenicity; and metastasis - 
provides; clinicians withi a, means to determine whether :such 
additional treatments are warranted. 

In- . addition ,to .facilitating .conventional; chemo therapy./., - 
10 appropriate diagnosis of p53 mutations provides clinicians-. 

with the- ability, to, -identify individuals, who will benefit the 
most from gene therapy techniques , in which^ appropriately. . ^ 
operative: p53, copies are restored to a tumor, site;..: Clinical; 
p53 gene therapy trials are presently underway (Culotta. .&..; 
15 Koshland; - supra) • '\/ - : . - - : ; \ ^ - - , . : 

- The:: analysis: of: p53 mutatiions^^;can also, bemused ;to : 
identify which carcinogens lead to particular tumors (Harris, 
Science 2.62, 1980-1981 (1993)). For . instance; dietary 
. aflatoxin B^. exposure, is associated /with G : C: to-T:A - 
20 transversions:. at. residue 249.; of p53 r in:: hepatocellular ■ . ■ r, 
carcinomas--{Hsu et al. . Nature 350 , 427 . -( 1991) Bressac; et\ . : 
ali, Nature- 350, .429' (1991); Harris , supra ), ^ - 

; . J While, most: described p53 mutations- are somatic, in, origin, 
some types^of . cancer are associated , with germline :p.53^ - ; 
25 mutatipn> . For .instance, Li-Fraumeni syndrome; is" a hereditary 
condition in which individuals receive mutant p53 . alleles, - 
resulting , in the . early onset of various cancers (Harris , 
supra) ;:.Frebourg et al , , PNA5 , 89 , 6413-6417^ ( 1992 ) ; Malkin et 
al. , Science 250.., 1233 (1990) ) . .These mutations are ; 
3 0 assocf atei with, instability in the rest of the genome ; • ' y 

creating ..multiple^ genetic, alterations, - and eventually, leading 
to cancer..: ;. . ■;> , - - ; , , - — . „ , - 

. . ;r: hMLHl:. and hMSH2 , are - mismatch repair genes;\which; are^ - 
causal , agents in hereditary nonpolyposis . colorectal -cancer in 
35 individuals, with , mutant hMLHl or hMSH2 alleles ..(Fishel . et al . , 
supra, and, Papadopoulos:et: al. , supra) . r Hereditary- r : - : - > rj 
nonpolyposis colorectal;- cancer: is.va; common: genetic disorders., 
af f ecting-about.^l in 2.00 individuals- (Lynch: et . al. ■:,:^.:ss r-^^. 
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GastroentBrology '-104 ^1535 (1993)0: -^ Detection . of ^ hMLHl and:- 
-hMSH2 mutations in the population allows, diagnosis of 
nonpolyposis colorectal cancer, prone, individuals prior -to the 
manifestation of disease This allows for the iiuplementation 
5 of special .screening programs for cancer-prone individuals to 
ensure .early detection- of cancer, -thereby enhancing survival • 
rates- of .afflicted, individuals . In^ addition , .genetic, 
counselors may use the information derived from KMLHl . and • 
HMSH2V:chips. to improve^.. family planning as described- for cystic 

10 fibrosis chips--. The detection of mutations:- in hKLHl'and hMSH2 
individually or in combination with- p53- can also be used by 
clinicians, to'' assess cancer prognosis and treatment modality.. 
Finally, > the- information can- be used -to target appropriate ' 
individuals' for gene therapy,. — - 

15 The entire hMLHl gene is less than 85 kbp in : length, • ' 

comprising 2268 coding nucleotides (Papadopoulos et; al* , 
supra),, Sequences from. the gene have been , deposited with 
GenBank' ^(accession - number U07418)-. - Mutations associated with 
hereditary nonpolyposis colorectal cancer ^ include: the deletion 

20 of exon '5 • (codons '578-632 ) V a 4 base pair- delet ion- of - codons- 
7 27 and 728 resulting in a shift in^ the reading frame of the' 
gene, a 4 base pair * insertion at codons 755 and- 756^resulting 
in' an extension of ^ the COOH terminus, a 371 base pair- deletion 
and f rameshif t mutation at ^ position 347 , and - a transversioh 

25 causing-an alteration vof codon 252 resulting in the .insertion 
of a- stop codon (id. ) . ^- ' - " • • • - * - . - ^- ^j. ■ . 

hMSH2 ' is a - human homblogue of the bacterial WutS -^and s - 
cerevisiae^ MSH mismatch-repair genes . - MSH2 , like - hMLHl - is - 
associated' with hereditary * nonpolyposis' cancer,- Although only 

30 a few MSH2 gene samples from tumor tissue ' have.~ been- ^ ^ 

characterized, at- least . some- tumor • samples show .a- T to - : ' 
transition mutation at position 2 020 of the cDNA sequenceV 
resulting in the - loss - of . an intron-exon splice acceptor- site. 
' "v-In-view of the-role of mutations- in- p53 ,;-MSH2 'and/or 

3 5 hMLHl = in hereditary., predisposition to cancer ,:. to neoplastic ' 
transformation events leading- to. cancer /and- to.'cancer-' 
prognosis, lt..*is^ important " to: screen - individuals tozdetermi'ne 
whether they . possess /mutant alleles , .and to identify, precisely 
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which ' mutations, - the., individuals: possess . ^ Because, many , " • 
mutations are poin,t;mutations, or extremely^, small inserjiipns 
or; deletions, which are generally. undetectable by standard. 
Southern* ana-lysis, accurate diagnosis requires; a capacity to 
5 examine a gene nucleotide-by-nucleotide , . r \ , / > - 

' - Mutations in the hMSH2 , hMLHl or p53 genes, irrespective 
of 'whether previously characterized-, can be detected by any of 
the' tiling strategies noted above. Reference sequences .of 
interest include full-length genomic and cDNA sequences of - 

10^ each of these genes and subsequences thereof, such as exons 
and introns. ^ .For example, . each nucleotide in the 20 kb p53 
genomic sequence can be tiled using the basic strategy with an 
array ::Of :about 80 , 000 : probes. ; As -in the CFTR , chip., some- 
reference, sequences are comparatively ■ short sequences 

15 including the site of - a -known mutation and a few flanking 
■ - nucieotidesr -Some Chi 

encompass mutational "hot .spots . '! For instance, a variety of 
cellular and oncoviral , proteins bind to specific regions of 
p53, including Mdm2, SV40 T antigen, Elb from adenovirus and 

20 . E6 from . human papilloma yirus*. These . binding , sites correlate 
to some extent; with observed high. frequency somatic mutation, 
regions of p53 found in tumor cells from cancer patients (see 
Harris -et al . ,. supra) . ^ Hot. spots include exons 2, 3, 5,- 6, 7 
and 8 and the intronic regions between exons 2 and 3 , 3 and 4 

25 and 4 and 5; Fragments of the hMLHl gene of particular 

interest include those encoding codons 578-632 , -727 , 728, 347, 
. 252... Some, chips are .tiled to read. mutations .in each of the 
hMSH2>-.hMIiHl and p53 genes, both wildtype and mutant versions. 
Standard or asymmetric PGR can be used to generate the 

30" target DNA used in the tiling, assays described above . in- 
general , , PGR, is. used., to, amplify. hMSH2 -hMLHl, or p53 sequences 
from a tissue of interest such as a tumor. Mixed PGR 
reactions can -also; be used , to- generate. hMSH2 ,- hMLHl or p53 . 
sequences- simultaneously in a single reaction mixture. Any of 

3 5 the. coding' or. noncoding sequences from the genes may be 

amplified for use in the block tiling assays described above. 
- \w : .-Table ' 8 - below • provides examples,. o.f. primers .which are . . 
useful in synthesizing specific regions of ; hMSH2 , hMHLHl and 
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p53 . Other priiners-' can- readily- be devised f rom-- the" known * ' 
genomic and cDNA sequences of the-genes ; The primers' 
described in Table- 8 specific for p53 amplif ication • have ends 
tailored to facilitate cloning into- standard restriction-. - 
enzyme cloning sites . : .: . ^ , . . . 

Table 8: Examples of PGR primers useful in amplifying regions of p53, hMHHI and.. 
hMSH2. " - - - . - - • 



10 



15 



20 



25 



Region 
Amplified 

Exon 5 
(p53) ^ 

Exon 5 . 
(p53) 

Exon 6 V 
Ip53) 

Exon 6 
(p53) ^ 

Exon 7 
(p53) - 

Exon 7 
(p53)^ ^ V 

Exon 8 , . 
(p53) 

Exon 8 ^ 
(p53) 

hMSH2 ' 



hMSH2' 
hMLHI \ 



Primer Sequence* ' , - 

TAA TAG GAG TCA GTA TAG' GGA GA GGC 
TGG GCA AGG AGC CGT.GTG GT ; 

ATG GAA TTA. AGC GTG AGT AAA GGG • ^ 
AGA GAG TTG TGG GCT GAG TTT GAA G 

TAA TAG GAG TGA GTA TAG GGA. GGC 
^TCC TGG GAG AGA GGC * 

ATG GAA TTA AGG GTC ACT AA GGG AGA 
TCC GCA GGC GTC TGA TTG GTC ACT G 

'tAA TAG GAC TCA GTA TAG GGA GTG 
"GGG CAG AGG GAG GCC^AGT GTG GA* 

'ATG CAA TTA*' AGC GTC ACT AAA GGG" '' 
AGA GTC TCC CCA AGG GGC ACT GGC* 
•QTC A : ; - _ ; • 

TAA TAG GAG TCA, GTA TAG GGA GGG \ 
CAT AAC TGG AGG CTT GGT CTC GTC C . 

ATG GAA TTA ACC GTC ACT AAA GGG : 
AGA GGA CCV GAT TTG CTT ACT GCC TCT 

TGC - , - . , V . ... 

'GAG ATG GGG GTG GAG CCG AAG GAG A ' 



^CTA TGT CAA TTG 'CAA ACA GTG 'CTC AGT^ 
TAC AG- ■ " ^ 

-CTT-GGC'TCT TCT GGC -GCC AAA ATG TGG * 
TTC : . ^ : : ■ ' . - - 



3 0 hMLHVi' 



TAT GTT- AAG ACA CAT CTA-TTT ATT TAT 
AAT CAA TCC • - . • - - - . ' • 



Description ' 

Exon 5 T7 Primer (5 - T7 
to p53 3'). ; 

Exon 5-.T3* Primer (5' T3 
to p53.3'). . 

Exon 6' T7. Primer {5 T7 
to p53 3'). . 

Exon 6 T3 Primer (5'T3 
to p53 3'). ' 

Exon 7 T7 Primer (5' T7^ 
to p53 3'): " ■ . - ' * 

Exon 7 T3 Primer*(5' T3 " 
to p53 3'): - ' - . 

Exon 8. T7 Primer (5' T7^v 
to p53 3'). 

Exon 8 T3 Primer (5' T3 \ 
to p53 3')'.'" '[ 

Primer^fdr^MSH2; 5' to 
3'. if used with MSH2 
primer below, a 3033 
base pair ampiicon will 
result ■ ' ~ - * ' 

"Prirner for hMSH2 5'to 

3'. ^ \ - .. 

Primer for'hMLHIvS'to 
■3;. If used with hMLHI 

primer below, a 2484 
' base' pair^ ampiicon will 
. result. v.*- r.. . .: ; - , 

•Primer for hMLHI 5' to 

.3:. , . ........ 
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■ ; - Af ter .PGR .amplif ica.tion of :,the^ target /amplicon one strand 
of the amplicpn. can be isolated , i . e . , using a biotinylated . 
primer that allows capture of the undesired strand on 
streptavidin beads . / Alternatively asyruinetric . PGR can ^^ be used 
5 to generate -a single-stranded target Another approach 

involves the generation o'f single stranded RNA ■ from the - PGR- 
product by incorporating a :T7 or other RNA polymerase -promoter 
in one of the primers. >The single-stranded .material can 
optionally be. f ragmented to generate smaller nucleic acids 

10 with less significant secondary structure than longer nucleic 
acids. ; . ' ■ :■ , ^ ^ ' 

In one- such method, . .fragmentation is combined . with 
labering.: To 'illustrate,, degenerate 8-mers or other - 
degenerate short oligonucleotides are hybridized to -the ^ 

15 single-stranded, target material. In the next step, . a-DNA 
poiymerase-' is added: with the four ^different r -r- j ^ ; 
didedxynucleotides, each labeled with a different fluorophore. 
Fluorophore-labeled dideoxynucleotide are available from a ^ 
variety of^ commercial suppliers. Hybridized 8-mers are 

20 extended ..by, av labeled , dideoxynucleotide . After an pptipnal.. 
purification, step/ .i.e ./ :with a size exclusion column, .the 
labeled 9.-mers are hybridized to the -chip. Other methods of 
target -f ragmentation can be employed. The single-stranded DNA 
can be. fragmented by partial degradation with a DNAse or 

25 partiail' depurination with acidV. .Labeling can be accomplished 
in a .separate step,: ■•i-.e . , f luorophore-labeled nucleotides are 
incorporated /before the .fragmentation, step or a DNA binding 
fluorophore, .such *as ethidium homodimer, is, attached to the 
target after fragmentation. « . 

..3 0.-^;^- : T-- ^^^-:.:V.--T . • '-^'.'"'^ ■ ; ^ . ■"■.:„:;": r ^' v: 

Exemplary Ghips ■ . , . i'. \- - ^ • 

: a.. • -Exon VI . Ghip • . 
" / ,;To illustrate the value -of , the. DNA chips of -the^ present 
invention in such ra method, , a ,DNA chip was synthesized by- ^the 
35 VLSIPS™ /method ;to . provide an array of overlapping probes .which 
represent : or tile ^across a 60 base region . of ■ expn 6 of the p53 
gene. :T.o.*'demonstrate; the . ability.- to detect substitution- 
mutations. .in the target ,:::twelve different single substitution 
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■ imitations (wild type and three ' different [substitutions .at each 
of three positions) were represented on the chip along with 
the wild typie- Each of these mutations- was represented by a 
series of twelve 12-iner oligonucleotide probes-, which were 
5 complementary to" the wild -.type target except at the one . . 

substituted base. Each of the twelve probes 'was .complementary 
to a -different -region of the target and contained the mutated 
base at a different position e • g , , if the substitution was at. 
base- 32 , "the set of -probes would be complementary- — with the . 

10 exception of base^32--to -regions of the target: 21-32 / • 22-33, / . 
and 32-43) . This enabled investigation of the effect of the 
substitution- position within the probe.: : The ^alignment of some 
of the probes with :a 12-mer model target .nucleic acid is shown 
in Figure 27 * ^ . , : v ^ 

15 To demonstrate the effect :of probe - length , an -additional 

series of ten 10-mer probes ^was included for- each mutation., 
(isee -Figure- 28) In the vicinity of -the substituted : 
positions/ the wild-type sequence was represented by every . 
possible overlapping ' 12 -mer and 10-mer probe .:. To. simplify. • 

20 comparisons,-- the probes ^corresponding :to-:each varied -position 
were •arranged on the chip in the rectangular regions with 'the 
following structure: -each row of cells .represents one ^ 
substitution, with the top* row representing the wild type. 
Each column contains probes complementary to the same, region 

25 of the target-/ with probes ^^complementary. to the 3 '-end. of the' 
•target- oh the -left and probes complementary -to the ;5V-end of 
the target on the right*. ■ The difference' between two -adjacent 
columns is 'a single base shift in the positioning .of the . 
probes. Whenever possible, the series^ of -lO-mer probes were 

3 0 placed in four rows immediately underneath and aligned with 
the 4 rows of 12r-mer probes for the same mutation. . 

To provide model targets, 5' f lubresceinated ,12-mers 
containing- all possible substitutions in:the first position of 
codon 19 2 were synthesized (.see - the' starred .position t in - the 

35' - target' in^Figure- ■ 27 ) Solutions containing: 10 nM : target .DNA 
- in "6X SSPE, 0 . 25%- Triton X-100 were hybridized: to /the : chip^ at 
room-- temperature for" several- hours. ':.While:.target . nucleic -.was 
* hybridized to- the- chip , ' the. f luorophores . on the chip.: were 
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excited- by: .light :from an- argon , laser.;- and the chip. .was :scanned 
with an autof pcusing conf oca 1- - microscope . - The .emitted -signals 
. were processed by a, PC. to. produce an image using image; 
analysis-software.. .By 1 to: 3, hours, the signal had reached a 
5 plateau; to .remove the hybridized- target . and allow 

hybridization fo another target, the chip was stripped; .with 
6.0% f brmamide,:: 2.-X SSPE: at. 17. .-C for 5 .minutes. The - washing 
buffer and temperature can vary , but , the buff er typically 
contains, ;2-tp-3X SSPE,, lO-to-60%' formamide (one. can use . - 
10 multiple washes.,- increasing .the formamide. concentration by 10% 
each, wash, - and scanning between, washes to determine when the 
wash. is complete) , and optionally a- small percentage- of Triton 
X-100,. and t.he.-temperature; is -typically in^ the' range . of- 

15-to-18 *.C;: -..^ • - ... 

: - Very distinct patterns were- observed after hybridization 
with-targets with 1 base substitutions and' visualization -with 
a confocal microscope: and software analysis, as shown in 
Figure 29. In general, the probes which form perfect matches 
with the. target , retain the highest signal. For- example ,.• in 
the;, first image > the. 12-mer: probes that- form, perfect matches, : 
.with- the, wild-type (WT)- target are in the first row (top) .- 
The 12-mer probes with single base mismatches are located in 
the second, third, and fourth rows, and have much lower 
signals. ; The data is also .depicted graphically in Figure 30. 
On each- graph, - the X ordinate is. -the- position of the probe .an ; 
its..row..on:. the chip, , and the Y ordinateMs the signal at-that 
probe - site a-f.ter; hybridization.-,; When a -target with- a 
different one base substitution is hybridized the 
complementary set of probes has the highest signal,, (see 
pictures 2 , - -3 , and-: 4 in- Figure 2? and- graphs 2 , 3 , and .4: in 
Figure 30) .In. each case,.: the- probe .set with: no, mismatches • 
with:. the target, has the .highest . signals . ; : Within a a^.-mer^ 
probe set , : .the. signal was . highest at .position -6- .or ■;7;.--. -. The' 
graphs-, show, .that the-- signal difference between 12-mer probes 
35 at the- same.. X ordinate tended to .be greatest • at .positions 5 - 
and.- 8 when the -target, and the complementary. :probes f ormed- 10 
base pairs.- and; llr base- pairs., respectively .; Because .tumors 
often have: both-: WT and. mutant p5 3-- genes.,-, mixed target 
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populations were 'also hybridized to the -chip / as shown • in 
Figure 31. When the hybridization solution consisted of a l:l 
mixture- of WT ■ 12-iner -and a 12-iner with a substitution- in • - 
position 7 of • the target , the - sets of probes that were 
5 perfectly matched to both targets showed higher signals than 
the other' probe sets. * ' : " , . . ' 

"The .'hybridization efficiency of a, 10-mer probe array as^ 
compared 'to a 12-iner .'probe- array was also compared. --The. 
10-mer ahd-"12-mer probe< arrays gave comparable signals see" 

10 graphs ^-4 in -Figure 30' and graphs 1-4 in- Figure 32) ' - ' ? 
However/' the'lO-mer probe sets, which are in rows 5-8' (see v 
images in Figure- 29) -, - seemed to be better in this model system 
than the .12-mer' probe sets at "resolving one- target from • • " 
another, consistent with the expectation that one base * 

15 mismatches'' are -more destabilizing f or'^- lO-mers than ■ 12-mers, 

Hybridization- results - within probe sets perfectly- matched' tb'-^ 
target- also:^ followed the expectation ' that , the more- matches •' 
the- individual, probei formed with the target, the higher the 
signal'.- However, duplexes with two 3 • dangles -('see' Figure 30 , 

20 position '6- in -graphs 1-4 ) • have about as much signal as-' the 

probes'-which are' matched^along their* entire length- (see- Figure 
30, - position^ .?, in graphs 1-4) . - . - . 

This" illustrative model' system shows that" 12 -mer-targetis 
that' "differ by one base substitutions- can be readily ■ 

25 distinguished' from one another by the novel- probe^ array - 

provided by the invention and that resolution ' of the different 
12-mer targets was somewhat' better ' with • the ' 10-mer probe' sets 
than with the 12-mer probe sets. . - r -i. . : 

b. ' Exon V Chip ^ - • - - 

30 " ' To -analyze DNA from- exon 5 of the p53 tumor^ suppressor 
gene; a 'set\of overlapping 17-mer probes was - synthesized on - a 
chip,-" "The probes for -the WT allele were- synthesized so' as- to 
tile across the entire exon with single base. over laps " between 
probes. For each WT- probe , a sets of ,4 ■ additional probes, 'one 

35 for each- possible: base substitution at position 7, were : 

synthesized and placed: in- a .-column relative/.to the WT/.probeV ' 
Exon '3" DNA- was- amplified byPCR with /primers .flanking: the ,v:^ 
exon. One- -of the :primers; was, .labeled : with fluorescein; the: : 
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Other primer- was •labele'd -with- biotin'. ' After ampl-ificat-ion, 
the. biotinylated strand was removed - by b-inding to streptavidin 
beads.' The fluoresceinated strand was used in hybridization. 



5 About 1/3 -of- the amplified, ■ single-stranded nucleic acid 

was hybridized overnight in 5 X SSPE at 60' C to the probe chip 
(under a" cover slip) . -After washing with 6 X SSPt, " the> chip 
was scanned using confocal microscopy. Figure 33 shows an ■ 
image of the p53 chip hybridized' to the target DNA.-^' Analysis 

10 of -the intensity data showed that 93.5% of the 184 bases of 
exon 5 were called in agreement with the WT sequence (see 
Buchman et al. , 1988; Gene 70; 245-252, incorporated-herein by 
reference) .- The miscaried bases were from positions where' 
probe signal intensities were tied (1 r6% )• and where • non-WT - 

15 probes -had ttte highest signal intensity (4 . 9%) . Figure 34 : -• 
illustrates^ how the actuar seq^^ in the ' 

sequence of letters in the WT rows correspond to control 
probes or sites. Positions at which bases are miscalled are 
represented by letters in italic type in cells corresponding 

20 to probes in which the WT bases have been substituted; by oth^r 
bases...; .- .r. ;r .. : . . .■■ . ;. - , , ,-, .. . .. . ■ .. .- .__ 

■- - As the diagram indicates, the miscalled bases are -from • 
the low intensity areas of the image, which may be due to - 
secondary, structure in the target or probes preventing 

25 .intermolecu.lar- hybridization. To diminish the effects due to 
secondary -structure,' one can employ shorter targets- (i.e:, by 
target fragmentation )> or use more stringent hybridization ■ 
conditions. ^ In. addition, the' use of a set of probes. - 
synthesized- by tiling across' the other strand of- a duplex •■ 

30 target- can- also provide- sequence information 'buried in ' " 

secondary- structure- in -the -other- strand. - It should- be - 

appreciated,- however; : that the pattern of low intensity -areas 
that .forms, as-^ a result of secondary- structure- in the target' 
itself provides a- means to identify-that a specific target -- ■ 

3 5 sequence is-, present- in; a sample . Other f actors that- may ^ • • 
contribute-: to- lower s-ignal intensities include differences'' in 
probe* diBhsit lies- and- ^hybridizatdon "stabilitiesV-' - ■ • 
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These- results deinonstirate the advantages provided -by . the 
. DNA chips of- the invention to genetic analysis. As another - 
example/ heterozygous mutations are currently sequenced by an 
arduous process involving cloning and repur if ication of DNA. 
5 The cloning step is required, because the gel sequencing 
systems . are. poor at resolving even a l:l mixture of. DNA. • 
First, -the target DNA is amplified by PGR with primers - , 
allowing.- easy ■ ligation into a vector, which is taken up by 
transformation of E. , coli . . which in turn must .be cultured;. 

10 typically on plates overnight. After growth of the bacteria, 
DNA is purified in a procedure that typically takes, about 2 
hours then, - the sequencing reactions are performed,- which - 
takes- at -least . another hour, and the samples are run ^on ..the- - 
gel for several hours, .the duration depending on the- length, of 

15 the f ragiTTient to be sequenced. By contrast, the present ■ : > - 
invention provides .direct analysis of the PGR amplified-: • 
material ..after brief , transcription and fragmentation steps, 
saving days, of time and labor. - - - 



20 .. p.. Mitochondrial Genome Ghips , ^; . 

A human cell may have several hundred mitochondria,, each 
with more than one copy of mtDNA, There is strand asymmetry 
in the base compositions , with one strand (Heavy) .being.- ir- 
relatively G rich, and. the other strand ^ (Light), being G rich. 

25 The .L -strand is .:30.9% A,. 31.2% G,. 13.1%- G,; and. 24.7%: Tv; Human 
mtDNA .is ^information-rich, encoding some 2 2, tRNAs 12S. and ,16S 
rRNAs,, and 13 polypeptides - involyed; in. oxidative;-, r ■ * r, - 
phosphorylation..: No introns have been detected. RNAs. are..- 
processed by, cleavage- at -tRNA sequences, and; polyadeny lated* - 

30 postranscriptionally . . In some^ transcripts > polyadenylation ^, 
also crea.tes -the stop codon,. illustrating the^ parsimony of / 
coding. ~ In many individuals, mtDNA can; be treated as haploid. 
However, V some -individuals are -heterpplasmic- (have; more-, than * ■ 
one mtDNA sequence) , and -the degree of heteroplasmy. can vary 

35 from tissue, to tissue. Also, the -rate of replication of- 

mtDNAs, ,can, differ, and; together- .with random segregation ..during 
cell division, can^ lead to -changes- in heteroplasmy-. pyeri- time. 
The human mitochondrial genome is 16,569 nucleotides 
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long. The :seguence .of the L-strand is- -numbered arbitrarily. ,, 
from the MboI-5/7 jDoundary in. the D-lopp region . . .The. complete 
sequence^ of the human mitochondrial .genome .has been published. 
See Anderson et al. , -Nature 290, ^-457-465 .(.1981) ... . 
5 Mitochondrial DNA.is maternally inherited^,, and has a mutation 
rate ■ estimated to be-- tenf old higher than single copy, nuclear 
DNA (Brown et al . Proc:- * ATa tl . Acad . Sci . USA^ ie , 1967-1971 
(1979) ) . Human mtDNAs differ, on average, by about;. 70 .base 

substitutions: (Wallace,, Ann.- Rev,- Bxochem. 61 , v. 1175-12 12 - 

10 (1992) ) . -Over 80%^ of substitutions' are transitions .:(i.e, , 
pyrimidine-pyrimidine or purine-purine). - 

Analysis of mitochondrial DNA, serves several purposes. - 
Detection of ^mutations- inr the mitochondrial genome allows 
diagnosis . of ;a number , of diseases. The mitochondriaa .genome 
15 has 'been^ identified- as the locus of several- mutations- ■ . - 

associated-; with human : Some of ^ the;: mutations result 

in stop codons in structural genes. Such mutations have been 

mapped and associated with diseases , such as Leber • s- - 

hereditary optic. .neuropathy, neurogenic muscular weakness , 
20 ataxia and retinitis .pigmentosa - Other mutations > (nucleotide 
substitutions) -occur in. tRNA. coding sequences,, and presumably 
cause conformational defects- in. transcribed tRNA molecules. . 
Such mutations have also been mapped and associated, with* 
diseases, such , as Myoclonic Epilepsy and Ragged Red Fiber. 
25 Disease. , Ariother. type of. mutation- commonly found is . deletions 
and/or .- insertions . Some deletions span segments- of several- r 
kb. - Again, such- mutations have been mapped^ and- associated- 
with diseases, for example, ocular myopathy and Person ^ - 
Syndrome. -See Wallace, Ann. i?ev. Biochejn. .61-1175-1212 ( 1992 ) 
30 . ^incorporated by reference in its entirety for alL purposes) . 
Early detection. of . such- diseases allows metabolic- or genetic 
therapy to be administered before irretrievable damage : has - 
occurred . ; .-Jd. . - -Analysis of -mitochondrial- DNA is also 
important for. forensic screening. - Because - the mitochondrial 
35 genome ^is .a locus/of - high • variability between- individuals , - - 
sequencing a substantial^ length of. mitochondrial DNA; provides 
a fingerprint that - is- highly-,- specif ic.tov an • individual;.:-.. j 
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Analysis of mitochondrial DNA -is also xmportant ' f or -* 
evolutionary and epidemiological ' studies . 

' The reference sequence ^can- be an entire mitochondrial 
genome or any fragment thereof. For forensic and 
5 epidemiological '^'studies / the reference sequence is often all 
or part of . the D-loop region in which variability between • * 
individuals is greatest {e.g., from 16024-16401 and -29-408). 
For detection of mutations/ analysis of the entire genome is 
useful as* a reference sequence, but shorter segments- including 

10 the sites of known' mutations , and about l-20.*flanking bases 
are also useful. Some chips have probes tiling ■ paired - ■ 
reference "sequences,' representing wildtype and mutant versions 
of a' 'Sequence, Tiling a second reference sequence- is - - • ' 
particularly useful' for detecting an insertion mutation' 

15 occurring- -iri 30-50% of ocular myopathy and Pearson syndrome- 
patients^ which consists of direct repeats of the. sequence-^ ' 
ACCTCCCTGACCA.- -Some chips include reference -sequences from 
more than one mitochondrial genome.- - • — /- 

Mitochondrial- reference' sequences . can -be tiled using any 

20 of- 'the- strategies, noted- above.- • The block tiling* strategy -is^^ 
particularly useful - for ' analyzing short reference^' sequences or 
known mutations. Either the- block strategy " or • the basic 
strategy is suitable for analyzing long reference sequences. 
In many of the tiling strategies, it is possible to use fewer 

25 probes compared with the number used ' in' other chips without - 
signxficant loss olf sequence information.- As "noted above , - ' 
most point- mutations in mitochondrial • DNA^; aire 'transitions , so 
for each- wildtype nucleotide in a reference sequence, -one 'of 
the three possible nucleotide substitutions is much more 

30 likely . than the" other two / - Accordingly > in the basic - tiling ' 
strategy/- for examiple, a reference sequence-' can-'be -tiled'-usihg 
only two' probe- sets. -One probe sets comprises a • plurality -of 
probes, each probe having a '• segment ' exactly^- complementary to* 
this'ref eriehce ' sequence." - The second probe - set - comprises "a 

35 corresponding probe- for each probe in the first set . ". However , 
a .probe" from the second' probe set dif f ers^f'rbm the ' 
corresponding- probe from - the f irst • probe " setrin an * r.': : 

interrogation position, in which the probe from the second 
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probe set' includes- :the' transition of th'e.'.nuc'leotide: present'.; in 
that position 'ih- the probe^'f rom the first probe set,. 

Target mitocKbndrial DNA can be ampirfied,. labelled and 
-fragmented prior to hybridization using the .same procedures as 
described for other chips.. Use of at least two labelled 
nucleotides: is' desirable to achieve unif orin/ laberiihg . Some 
exemplary ''primers are. described below and - other primers: can be 
designed from the known' sequence of mitochondrial DNA. 
Because mitochondrial: DNA is present in multiple copies, per; 
cell; it* can' also be hybridized directly to a; chip. without 
prior amplification. ^ 



Exemplarvr chips .v. ; \ : . - 

The invention provides: a DNA chip for /analyzing, sequences 

15 contained: in - a: 1. 3 kb. fragment of human mitochondrial: 'dna from 
' ^ 'the -"Dr-lbop" - regio^^ human ' 

mitochondrial DNA. One. such chip comprises a set of 269 
overlapping oligonucleotide probes of varying' length ;in the 
range of 9-14 nucleotides with ^varying overlaps. ' arranged in 

20 "600 X 6ao micron features or synthesis , sites' in an array l cm 
X 1 cm in :size.: The- probes on the chip are shown in columnar 
form below. An illustrative mitochondrial DNA chip of the 
invention comprises the following probes (X, Y coordinates are 
shown, followed by the sequence-; "DL3" represents the 3 '-end 

25 of the probeV: which^.is covalently attached to the chip ^ 
surf aceV) : ' ;/ / r: ; ; ' 

DL3GGTTGGTTTGGG 
DL3 TGGGGTTTCTAG 
DL3 GTTTCTAGTGGG 
DL3AGTGGGGGGTGT 
DL3GGGGTGTCAAAT ' . 1 . . V 
DL3 GTCAAATACATCG 
DL3ACATCGAATGGAG ■ 
DL3 CGAATGGAGGAG 
DL3GAGGAGTTTCGT • " 

DL3 TTTCGTTATGTGA ' 
DL3ATGTGACTTTTAC * 
■ DL3GACTTTTACAAAT " 
DL3AAATCTGCCCGA « 
DL3AATCTGCCCGAG 
DL3CCCGAGTGTAGT ^ 
^DI.3AGTGTAGTGGGG c 
DL3 GGGAGGGTGAG 
DL3GGTGAGGGTATG " 





0 


0 


^ DL3 AGTGGGGTATTT. 


1 


1 




1 • 


. 0 


DL3 GGGTATTTAGTT 


2 


1 




2 


0 


DL3 TTAGTTTATCCAA 


3 


1 


30 


3 


0 


DL3ATCCAAACCAGG 


4 


1 




- 4- 


O 


DL3ACCAGGATeGGA ^ 




*1 




5 


0 


DL3CGTGTGTGTGTGG 


6 


.1 




6 


0 


'DL3 CGTGTGTGTGTGGC 


7 


1 




7 


0 


DL3TCGTGTGTGTGTGG 


8 


1 


35 


8 


0 


DL3GTAGGATGGGTC' 


9 


1 




9 


0 


DL3AG'GATGGGTCGT . 


10 


,1 




10 


0 


DL3GATGGGTCGTGT ^ 


.^11 


1^ 




11 


0 


DL3 TGGCGACGATTG 


12 


1 




12 


0- 


-DL'3 GCGACGATTGGG* 


13 


1 


40 


13 


0 


DL3TGGGGGGGA ' . . 


14 


1 




14 


0 


DL3GAGGGGGCG ^ \ 


15 . 


1 




15 


0 




16 


1 




16 


0 


DL3GAGGGGGCGA • 


0 


2 




0 


1 


DL3GGCTTGGTTGG 


■1 


2: 
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UXj J X X X tj X AijAvjiAL.Lj - 
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X O 




X X 


*i 


PiT T Ta a a r'/^^TTT 

UXj^ XAwAOAU.uuX XX. 


n 
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1 

X ^ 




nT A r^GPTTTGGGG 
LJXjJAwo^jX X XuOL^O 


T 
X 








nT '^TGGGGTTTTTGT 

L^Xj^XWWVjwXXXXXwX 








*l 


nT,7 GGGTTTTTGTTT 

X^XjJVjwWXXXXXwXXX 


-x 

•J 




1 S 

X ^ 


/ 
*♦ 


ni TTTGTTTPTTGGG 

XJX-iJXXuXXX^XXVjV^v? 








A 


ni ^Tr'TTGGG ATTGTG 

Jm/Xj^ X^X XwwU r\ X X w X O , • 




sn 
^ \j 


n 


c; 


nT TTGT ATGA ATG ATTT 
JL/ Xj «j X O X A X wrvr^ X X X X 






X 


c 


nt '^TG ATTTr* A GAG A A 
UXj.J XVjiAX X X CA^^rWvivi 


7 




2 


5 


DL3ACACAATTAATTAA 


8 




3 


5 


DL3AATTAATTACGAA . 


9 




4 


5 


Dr:3TACGAACATCCTG 


10 


55 


5 


5 


DL3ACGAACATCCTGT ' - : 


11 




6 


5 


DL3.TCCTGTATTATTA 


12 




.7 ' 


5 


DL3GTATTATTATTGTT 


13 



5 DL3ATTGTTAAACTTA • 

5. •DL3AAACTTACAGACG 
5 DL3ACAGACGTGTCG 

5 .DL3GTGTCGGTGAAA ' 
5 DL3GTGAAAGGTGTGT 
5 DL3GGTGTGTCTGTAG 
5-. DL3TGTGTCTGTAGTA • - 
5 DL3GTAGTATTGTTTT 

5 DL3AGTATTGTTTTTT 

6. DL3CCTCGTGGGATA 

6. DL3TGGGATACAGCG 

6 DL3 GATACAGCGTCAT 
6.. DL3GCGTCATAGACAG'' 
6 DL3AGACAGAAACTAA 
6 DL3CAGAAACTAAGGA 

6 DL3TAAGGACGGAGT: . 
6 DL3GACGGAGTAGGA 
6 DL3GTAGGATAATAAA 
6 DL3TAATAAATAGCG ; - - - 
6 . DL3ATAGCGTAGGAT 
'6 ■ .DL3TAGCGTAGGATG 
6 . DL3AGGATGCAAGTT; : 
6 ,DL3 ATGCAAGTTATAA 
6 ' DL3.GTTATAATGTCCG' 
6 DL3ATGTCCGCTTGT ^ * 

6 DL3TCGGCTTGTATG 

7 DL3GTGAGTGCCCTC 

7 DL3TGCCCTCGAGAG/ - : - 
,7^ DL3CCTCGAGAGGTA , ... 
7^' DL3AGAGGTACGTAA ' ' 
7 : DL3ACGTAAACCATA - 

7. DL3AC.CATAAAAGCAG . - 

7 DL3AAAGCAGACCC ^ • 

7 DL3AGACCCCCCAT . 
7 DL3CCCCCATACGT 
7 DL3CATACGTGCGCT ^ 
7 :DL3GTGCGCTATCAG V. . 

7 DL3GCGCTATCAGTA 

7 DL3TCAGTAACGCTC 
7 DL3GTAACGCTGTGC- 
7 DL3CTCTGCGACCTC - 
7 DL3GACCTCGGCCT * 

7 DL3TCGGCCTCGTG - 

8 DL3GATGAAGTCCCAG . 

8 DL3AGTCCCAGTATTT - ' 
8 DL3GTATTTCGGATTT' 

8. DL3TCGGATTTATCG- 
8 DL3GATTTATCGGGT 

8 DL3ATCGGGTGTGCA: / 

8 DL3TGTGCAAGGGGA 

S DL3CAAGGGGAATTT: 

8 DL3GAATTTATTCTGTA • 

8 DL3TCTGTAGTGCTAC . 

8 DL3GTAGTGCTACCT 

8 DL-3GCTAeCTAGTAG- ; " : 

8 DL3CTAGTAGTCCAGA 

8 DL3TCCAGATA'gTGGG 
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14 8 DL3AGATAGTGGGATA 

15 8 DL3GGGATAATTGGT , 

16 8 DL3TAATTGGTGAGTG 
0 9 DL3TATAGGGCGTGT 

5 19 DL3GGCGTGTtCTCA 

2 9. DL3GTGTTCTCACGAT 

3 9' DL3TCACGATGAGAGG 
. ' 4 . 9 , DL3ATGAGAGGAGCG 

5. 9 DL3AGGAGCGAGGC 
10 6 - 9 DL3CGAGGCCCGG 

7 9, DL3GCCCGGGTATT . 

8 9 DL3CGGGTATTGTGA 

9 9 DL3GTGAACCCCCAT ■ 
10- 9 DL3GCCCATCGATTT 

15 11 9 DL3ATCGATTTCACTT 

12 9 DL3TTTCACTTGACAT ' 

13 9 DL3TTGACATAGAGGT. 

14 9 DL3TAGAGCTGTAGAC 
.15 9 DL3GTAGACCAAGGA 

2 0 16 '9 DL3ACCAAGGATGAAG 

0 10 DL3CGTGTAATGTCAG 

1 10 DL3TGTCAGTTTAGGG 
. 2 1 0 DL3 TC AGTTTAGGG A 

3 10 DL3TAGGGAAGnAGCA 
25 4 10 .DL3AAGAGCAGGGGT 
.5 10 DL3CAGGGGTACCTA 

6 10 DL3GGTACCTACTGG 

7 10 DL3TACTGGGGGGA 

8 10 DL3GGGGGAGTCTAT 
30 9^ 10 DL3AGTCTATCCCCA ' 

• -10' 10 - DL3ATCCCCAGGGA • 

11 10 DL3CAGGGAACTGGT 

12 10 . DL3ACTGGTGGTAGG 

13 10 DL3CTGGTGGTAGGA 
35 14 10. DL3GTAGGAGGCACA 

15 10 DL3GGCACATTTAGT 
16* 10 DL3TTTAGTTATAGGG - 

0 11 DL3AGGTTTACGGTG 

1 11 DL3TACGGTGGGGA ' 
40 2- 11 DL3GTGGGGAGTGG - 

3 11 PL3GGGAGTGGGTGA 

4 11 'DL3GGGTGATCCTATG 
5. 11 DL3CCTATGGTTGTTT 

.6;. , 1 l,:.DLaGi:^riKTTT^ - 
45 , 7: 11 DL3GTTTGGATGGGT 

8 11 %DL3ATGGGTGGGAAT - 

9 il DL3GGGAATTGTCATG 

10 11 DL3GTCATGTATCATGT 

11 11 •^DL3TCATGTATTTCGG 
50 12 11 ..DL3TATTTGGGTAAA 

13 11 'DL3TTCGGTAAATGG 

14 11 DL3GTAAATGGCATGT 
15, 11 DL3GCATGTAATCGTG 

16 11 DL3GTAATCGTGTAAT ' 
55 5 ~~12 - DL3GGGAGGGGTAC - 

6 12 PL3GGGTACGAATGT 

7 12 *DL3ACGAATGTTCGTT 



8 1-2 

9 12 

10 12 

11 12 

12 12 

13 12 

14 12 

15 ,12 

16 12 
■5 ' 13 



6 
7 
8 
9 



,13 
13 
13. 
13 



10 13 

11 13 
12, 13 

./13 13 

14 13 

15 13 
. 16 13 

5 14 



,6 
7 
8 
9 



14 
14 
14 
14 



10 14 

11 14 

12 14 
^13 14 

14 14 

15 14 

16 14 
5 15 



6 
7 
8 
9 



15 
15 
15 
15 



10 15 

11 15 

12 15 

13 15 

14 .'l5 

15 '15 ■ 

16 15 
5 ' 16 



6 
7 
8 
9 



16 
16 
16 
16 



10 16 

11 16 

12 16 

13 16 
14/ 16 
15. 16 
16 16 



DL3TGTTCGTTCATGT 
DL3 CGTTCATGTCGTT 
DL3GTGGTTAGTTGG . 
DL3TAGTTGGGAGTT 
, PL3 GGAGTTGATAGTG 
DL3ATAGTGTGTAGTT 
DL3 GTGTAGTTGACGT 
DL3 TGACGTTGAGGT . 
DL3CGTTGAGGTTTA ^ 
DL3 TATAACATGCCAT 
DL 3.AACATG CC ATGGT 
DL3 CCATGGTATTTAT 
DL3ATTTATGAACTGG ' 
. DL3 AACTGGTGGACAT 
DL3 TGGACATCATGTA 
DL3 CATGTATTTTTGG 
DL3 TTTTGGGTTAGG 
DL3 GGGTTAGGATGT 
DL3 GGATGTAGTTTTG " 
DL3TGTAGTTTTGGG ... 
DL3 TTTGGGGGAGG 
DL3GGGTTCATAACTG 
. PL3ATAACTGAG.TGGG 
DL3AACTGAGTGGGT 
DL3GTGGGTAGTTGT 
PL3GTAGTTGTTGGC 
DL3GTTGGCGATACA ^ 
DL3 OGATA CATAAAAG 
D L 3 T AAAAG C ATG T AA 
DL3GCATGTAATGACG 
DL3ATGACGGTCGGT 
DL3 GTCGGTGGTACT 
DL3 GGTACTTATAACA 
DL3 TCGATTCTAAGAT 
D L 3 TAAG ATT AAATTT- 
DL3AAATTTGAATAAG 
DL3AATAAGAGACAAG • 
DL3 AAG AG ACAAG AAA 
DL3AAGAAAGTACCC 
DL3AAAGTACCCCTT ■ 
PL3 CCCCTTCGTCTA 
DL3 CTTCGTCTAAAC 
•DL3 CTAAACCCATGG - 
pL3^Agg^^Eg5|EG@ : 
DL3TGGTGGGTTCAT 
DL3 TTGGAAAAAGGT ^ 
DL3AAAAGGTTCCTG 
DL3GGTTCCTGTTTA 
DL3 CCTGTTTAGTCTC 
DL3TTAGTCTCTTTTT 
DL3CTTTTTCAGAAAt' 
DL3 AGAAATTGAGGTG ' 
DL3AAATTGAGGTGGT 
DL3GGTGGTAATCGT 
DL3TAATCGTGGGTT ' - 
PL3GTGGGTTTCGAT 
DL3GGTTTCGATTCT * " 
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No probes were present in positions X, Y = 0, 12 to X, Y = 4, 

12; X, Y =. 0, 13 to X, Y = .4, 13; X, Y = 0, 14 to X, Y = 4, 

14 ; X, Y = 0, 15 to X, Y = 4 , 15 ; X, Y = 0, 16 to X, Y = 4 , 

5 16; .; , . • : ' . ... - \ " . • 

The length of each of the probes on the chip was variable to 
minimize differences in melting temperature and potential for 
cross-hybridization. * Each position in the sequence was 
represented by at least one probe and most positions were 

10 represented by - 2 or more probes. As noted above, the amount 
of overlap between the oligonucleotides varied from probe to 
probe. Figure 35 shows the human mitochondrial genome; "Oj^"- 
is the H strand origin of replication, and arrows indicate the 
cloned unshaded sequence. 

15 DNA was. prepared from hair roots of six human. donors (mtl 

to mt6) and then amplified by PGR and cloned into M13; the 
resulting clones were sequenced using chain terminators to 
verify that the desired specific sequences were present. DNA 
. from the sequenced M13 clones was amplified -.by. PGR,. _ . ; 7 

20 transcribed in vitro ,^ and labeled with f luorescein-UTP using-? 
T3 RNA polymerase. The 1.3 kb RNA transcripts, were fragmented 
and hybridized to the chip. The results showed that- each f 
different .individual had DNA -that produced a. unique • , , 
hybridization fingerprint on the chip and that the ^differences 

25 in the ^obseryed patterns' could be correlated with differences 
in the cloned genomic DNA sequence. . The results also 
demonstrated^ Y.^^V lorig sequences of a ^target nucleic acid 

can be represented comprehensively as a specific set of ■ 
overlapping; oligpnucleotides and that arrays of such , probe 

30 sets can be usefully applied to genetic analysis. • - , ^ 
The sample -nucleic /acid was hybridized .-to the chip in a 
solution composed of 6 X SSPE, 0.1% Tr iton-X 100 • f or ^ 60 
minutes- at 15^-C.- The chip was then scanned by confocal : 
scanning- fluorescence- microscopy. -The individual ; features on 

35 the chip were 588 . X 588 microns , but the, lower . left 5 x ^5^ ... 
square -features in the array did not contain probes.- To ■ 
quantitate ..the data , ; pixel^ counts "were measured.. within, each - 
synthesis' site-.- Pixels represent 50 x 50 microns. The • 
fluorescence intensity for each feature was scaled to a mean 
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. determined' 7from./2 7^^bri-ght -features'.- -.After 'scanning;- -the -chip 
was- stripped .and. rehybridized; all six^ samples were hybridized 
.to the same chip. / Figure 3 6 shows- the image- observed from the 
. mt4 sample: on- the ;DNA chip.. Figure 37 shows the image ■ 
5 observed from, the mtS sample on the" DNA chip, ■ Figure 3 8 shows 
the predicted-difference image between the mt4 and -mt5 samples 
on the DNA- chip .based: on mismatches between- the two samples 
- and the reference sequence (see Anderson et al.;^ supra) , - 
Figure '3 9 shows' the actual difference image observed, ; . - 
0 ; 7 The- results /show that , in almost all cases , mismatched 
probe/target hybrids resulted in lower fluorescence intensity 
than perfectly matched hybrids. Nonetheless/ some probes 
detected mutations (or specif ic .sequences) better- than- others , 
and in several /cases; ..the 'differences were rwithin noise - - . r 
5 levels-. : Improvements-can be. realized by rincreasing^ the ,amount 
of overlap between- prober: ^hd hehce^^^l^ probe density and, 

for duplex DNA targets, using a second set of probes, either 
on the same or a separate chip, corresponding to the second ^ 
strand of the:target. Figure 40, in sheets l and 2, shows a 
) plot of;:hbrmalized intensities across rows- 10 and* 11 -of the 
array, and-a tabulation of the mutations detected. 

.Figure 41 shows the discrimination between wild-type and 
mutant hybrids obtained with this chip. The median of the six 
normalized hybridization scores for each probe was taken. The 
graph plots.^ the ratio-of the. median score to the" normalized * , - 
; hybridization score versus mean -counts. - On this graph,- a - ■ 
ratio of : l>6 and* mean counts above 50 yield no f alse^ - - 
positives; -and while it is clear that detection of some 
mutants- can be. improved, . excellent discrimination is achieved, 
considering the small- size of the array. ^ Figure; 4 2 
illustrates' how ;the identity of the base -mismatch may. 
inf luence^ the ability to discriminate mutant -and wild-type 
sequences more than , the. position-of - the mismatch within an - 
. oligonucleotide /probe,. The mismatch- position-is .expressed - as 
• . of . -probe length f rom the : 3 ' -end ^ The' base -change - is - — 
indicated, on .the graph... These results show that the^DNA chip 
increases: :the capacity of- the ^standard reverse- dot -blof 'format 
by orders:- of -magnitude^ extending .the" power -„of- that ^approach 
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many fold and tHat ' the methods of the invention "are more \ • 
efficient and easier to automate than gel-based methods of 
nucleic ' acid ' sequence and mutation analysis. 

To illustrate further these advantages,- a second chip was 
5 prepared "for analyzing a longer segment from human 

mitochondrial' DNA' (mtDNA) • ' The chip "tiles"' through 648 
nucleotides of a reference sequence comprising human H strand 
mtDNA from positions 16280 to 356, and allows analysis of each 
nucleotide ' in the reference sequence . The probes' in the array 

10 are 15 nucleotides in length, and each position in .the target 
sequence is represented by a set of . 4 probes (A, C,:G,.:T ; ; 
substitutions)-, which differed from one another at- position 7 
froin the 3**-end. The array consists of ^3 blocks of 4 x 50 
probes: "reach block scans .through 50 nucleotides of contiguous 

15 mtDNA sequencer The blocks are separated by blank rows/ The. 
'4 corner columns contain control probes ; there are a -total of 
2600' probes in a 1.28 cm x 1.28 cm square area (feature) , and , 
each area iis '256 x 197 microns, *' - 

Target RNA was prepared 'as above. The RNA was fragmented;- 

20 and' hybridized to the oligonucleotide -array, in. a •so"luti''on ' r . 

composed of 6X SSPE, 0.1% Triton X-100 for '60 minutes at ' 18 '*C.,, 
" Unhybridized- material was washed away with buf f e'r , - 'and the 
chip 'was scWnned at^ 25 micron pixel resolution^ 

Figure 43 provides a 5' to 3* sequencie listing of one ' 

25 - target corresponding to the probes on the - chip/-*' X- 'is- a 

control probe'. Positions that differ 'in the" target {•i-.-e. are 
mismatched -with the -probe at the designated' site)' are' in bold. 
Figure "44 shows the fluorescence image -produced by scanning 
-the -chip when 'hybridized' to' this "sample. • About- 95%^ 'of ' the^ - 

3 0 sequence could" be' read* correctly from only one strand" of the 

original, "duplex target nucleic acid. " Although some probes . did 
not provide excellent discrimination and .some probes- :did nbt 
appear ;to* hybridize to the target efficiently', excellent : 
results were achieved-'* The: target :sequence dif f ered .:f rom the 

35 probe set at six positions:- 4 transitions' and 2- insertion's. 
A;ll-^ 4 'transitions were detected, and specif ic probes/ could" * * 
rreadlly be Incorporated- into -the . 'array to detect-. Insertions or 
deletions ; v Figure: '4 5 Illustrates .the -detection: of*".-4' - ' 
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transitions ■ ''in" the ^ target - sequence relat ive.' to the. wild-type 
probes on. the^ 'chip , - : : \ ; ; . , ^ - ■ 

A further chip was constructed . coirtprising probes tiling 
across the entire D-loop region (1.3 kb) of mt DNA sequences 
from two humans. • The^probes were tiled in- rows of four using 
the basic tiling strategy. The probes -were . overlapping . 15 
mers having an interrogation position 7- nucleotides , from the 
3 ' end. ' The -complete group ' of ^ probes- tiled on the reference 
sequence from the first individual , designated mtl, * occupied 
the upper half of the chip. The lower half of the chip^ 
contained a similar arrangement based on a second clone, mt2 . 
The probes were synthesized in a 1.28 x 1.28 cm area, which 
contained a matrix of 115 x 120 cells. The chip contained : a 
total of - lG/488 mtDNA probes. V ^ ' ' ' v i- 

■ Six -samples of target . DNA' was extracted -form. hair, roots 
from six -individuals. " The 1.3 kb region spanning-pdsitions: - 
15935 to 667 of human mtDNA was PGR amplified, cloned in 
bacteriophage M13 ' and sequenced by conventional methods. ..The 
1.3 kb region.was reamplified from the phage clone using, 
primers/ rLr593 5-^T3 , ^ . . . • „ ; ;* / 

•5 'CTCGGAATTAACCCTCACTAAAGGAAACCTTTTTCCAAGGA: and H6 67-T7 , • - 
5 'TAATACGACTCACTATAGGGAGAGGCTAGGACCAAACCTATT- tagged/with T3 
and T7 RNA polymerase promoter sequences.. Labelled RNA was. 
generated by- in vitro transcription using T3 RNA polymerase; 
and f luoresceinated nucleotides , fragmented, ' and: hybridized , to: 
the mtDNA control - region resequeneing chip, at- room, temperature 
for 60 -min, in - 6xSSPE * + . 0 .05% . triton X-100 . Six^ washes were 
carried^ out at room temperature, using 6xSSPE + 0.005% triton 
X-100, and the chip , was read. Signal intensities^ varied.. , 
cons idefably ^over:.^ the., c dynamic range of^ the 

detection system - allowed accurate . quantitation of intensities 
over ^^'several orders of -magnitude. Even relatively low signal 
intensities ^.yielded accurate' results . ^: 

Five different clones (mtl-5) ^ were hybridized each to a 
separate " chip. . -The reference .'sequence: was also hybridized for 
comparative' :purposes. :Mean counts per probe cell were, 
determined,' -arid used by '^^automated :basecalling: software- to read 
the sequence The accuracy: of s*Gquence/:read from the - chip . is 
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summarized as follows, .-Combining the data from the five . . 
targets analyzed, the chip read a total of 6310 nucleotides. 
Of these -nucleotides in: the target sequences , 55 were 
different from the reference sequence ^ (as judged by 
5 conventional sequencing). (41 of these 55 -nucleotides were both 
detected and read correctly from the chip).. 6 of . 55 
nucleotides were detected as being . ambiguous but their 
identity: could not be read. 2 of 55 -nucleotides were detected 
as- mutations , but their, identity was miscalled. 6 of . 55 

10 nucleotides were incorrectly called as wildtype. Of the 6255 
nucleotides in the .target sequence that were identical -to the 
reference. sequence, only *36 (0.57%) were miscalled or scored 
as ambiguous;. ' 

A further chip was constructed comprising probes tiling 

15 across. a reference sequence comprising an : entire -mitochondrial 
genome;. In this chip,. a block tiling strategy -was used. :Each 
block. was designed to analyze seven nucleotides from a target 
sequence.., ; Each block consisted of four probe sets, the probe , 
sets -each having seven * probes . -A. block was laid; down, on the 

20 chip in seven columns of four probes. The upper probe -was , the 
same- in each column, this being a "probe exactly complementary 
to a -subsequence of the reference sequence. - The three other j 
probes in each: column were identical to the upper probe except 
in an: interrogation position, .which was occupied by a ■ - 

25 - different base^" in"- each of the' four probes; in thecolumn. The 
* interrogation position shifted by one position between : *. :* 
successive columns. - -Thus', except for- the- seven interrogation 
positions', one - in each of the columns of probes, all probes 
occupying a block were identical The array comprised many. 

30 such blocks, - each tiled to successive subsequences ..of -the ■ 
^ mitochondrial DNA -reference sequence. ,In ;all> ..the chip tiled 
15,569 nucleotides of- reference sequence with double /tiling at 
42 positions. 66,276 probes occupied an array of 3 04 -x 3 15 . 
cells, each.-cell having an area of 42 x: 41 .microns . - i 

35 - The- chip: was hybridized- to the .-same target sequences as 

described - for- the: D-loop region, except . that hybr idizatioh • was 
at 15°C for: 2"^ hr;-' :/The. chip: was," scanned at . 5 micron -resp^lut ion 
to give' an image:.. with approximately. 64 -.pixels per :cell-.. Flor 
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- blocks ;of probes tiling across :the D-loop; region, , a,:s,eguence- 
specific hybridization pattern was .obtained. ^ For other - ; . 
blocks/ only .background hybridization was, _ observed . . 

These .results illustrate that longer seguences -can be . 
5 read, using the .DNA .chips and methods of the ;inventipn,.: as - - 
compared „to conventional sequencing - methods, where ^ reading 
length is .limited by- the resolution of gel . electrophoresis . 
Hybridization and signal, detection require less than . an hour 
and can be readily, shortened by appropriate choice. of- buffers, 
10 temperatures;., probes,: and reagents. / . ■ — 

III. MODES OF PRACTICING THE IWENTTQN 

■ v:A."... : VLSIPS^. Technology ; , ; , / 

. . ^ / , As. noted above,, the „VLSIPS™ .technology is ^described in a 
15 number/.of, patent publications and:is preferred for . making, the - 
- . -. oligonucleotide.^ arrays of the inventions ; A .^brief ;;descriptiori . 
of .how„ this -technology, can be used to make and screen DNA 
■chips is provided ;in this Example and the accompanying' 
Figur.es. , In the VLSIPS™ method, light is shone through a .mask 
20 to activate functional (for oligonucleotides, typically an - 
-OH).r. groups protected, with a photoremovable protecting group 
on a surf ace of a solid support. After light activation, a 
nucleoside building block, itself protected with a 
photoremovable protecting group (at the 5 ' -OH) , is , coupled to 
2 5 the activated areas of the support , The process can .be 

repeated,, using, different masks or mask prientations : 
building blocks, to.- prepare yery dense arrays . of , many . 
different oligonucleotide probes.. The process is illustrated 
in: Figure 4.6;., Figure 4 7 illustrates . how - the process can be - 
30 used^ to;:.prepare ^nucleoside combinatorials'^ or - ^ .. ' 1 ^ S 

oligonucleotides synthesized, by coupling all four ^nucleosides 
to., f prm, dimers trimers. and so forth.- *; . 

New; methods f or, the. combinatorial chemical^ synthesis of 
peptide,,, polycarbamate, . and oligonucleotide arrays have 
35 recently been reported- (see Fodor et_al. , 1991 , science ; 251 : 
767-773; Cho et al., 1993, Science 261: 1303-1305; and 
Southern et al., 1992, Genomics 13: 1008-10017 ,;:,each of which 
is incorporated herein by reference). These. .^arr.ays or 
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biological chips"^ (see Fodor : et al., 1993 ^ ^Nature 364,: 555-556, 
incorporated herein by reference) , harbor specific chemical 
compounds at precise locations in a high-density,, information 
rich- format, and are a -powerful tool for the study of 
5 biological recognition processes. A particular ly . exciting 
application • of the array technology is in the field of. DNA:.- 
sequence analysis . • The hybridization -pattern .of. a' .DNA target 
to an array of -shorter oligonucleotide * probes is used to gain 
primary structure information of the DNA target. This -format . 

10 has important applications in -sequencing by. .hybridization-/;..DNA 
diagnostics and in elucidating the thermodynamic parameters 
affecting nucleic acid recognition. . 

Conventional DNA sequencing technology , is a, laborious 
procedure requiring electrophoretic size, separation of labeled 

15. DNA -fragments. An alternative approach,.- termed -Sequencing iBy 
Hybridization ~(SBH) , has been proposed (Lysov et al. ,--1988"/* 
Dokl . Akad. ' Nauk SSSR 303:1508-1511;^ Bains et al. , .1988-, J.- 
Theor . -Biol . -135 : 303-307 ; and Drmanac et al .-, 1989 , Genomics 
4 : 114-128 , incorporated herein by reference). This method 

20 uses a set ;Of short oligonucleotide probes of defined, sequence- 
to "search for complementary sequences on -a longer target - ^ 
strand of DNAI - The hybridization pattern- is used- to' - 
reconstruct the target DNA sequence. It -is- envisioned- that ; 
hybridization analysis of large numbers of probes- can be used 

25 to sequence • long ^stretches of DNA. ' In- immediate- applications - 
of this hybridization methodology, a-small number^ of- probes, 
can be used to -interrogate' local DNA. sequence. ^ ^ 
" The strategy of SBH can be illustrated by -the following 
example. ■ A 12-mer' target *DNA^ sequence, AGCCTAGCTGAA," is mixed 

30 with a complete set -of ' bctariucleotide probes Tf- only- perfect 
- complementarity -is considered., five- of the- 65 ,'-53 6^ octamer - . 
probes -TCGGATCG, CGGATCGA; GGATCGAC, GATCGACT, and ATCGACTT 
'will • hybridize to the target." Alignment of the over lapping 
sequenceJs from -the -hybridizing> probes reconstructs the ■ 

3 5 : complement of the original ■a2-mer target ; - - ; . 

^ - ^TCGGATCG . , - - - - . . - . 

^CGfGATCGA " ' " ' V" ^ ^ 

V • GGATCGAC •■■^^---^^ • •-•^ . - i ... 

4 0 GATCGACT v 
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_ . TCGGATCGAGTT _ ^ _ . ; 

Hybridization methodology can^ be carried- out by -attaching • 
target DNA;^to :a -surface. .The target is interrogated with a^ 
set; of -Oligonucleotide probes , one at a . time - (see Strezoska et 
al . , : 1991 Proc . Nati . Acad - Sci'.- USA 88:10089-10093,: and 
Drmanac et al . .1993 , • Science 260 ::1649-1652 , each of which is 
incorporated; herein by reference) . This approach can. be 
implemented with well established methods , of -iironobilization 
and hybridization detection, but involves : a large - number * of 
manipulations. For example, to probe, a sequence utilizing a 
full set of octanucleotides, tens; of -thousands of 
hybridization reactions must be performed.,. Alternatively,. SBH 
can be carried out by , attaching probes to a, surface in :an 
array, format where the .identity of the^ probes :at -each site is ; 
known . .. The target . ^DNA- is^^t^hen^ added --^o- the array--of rprobes . 
The hybridization -pattern determined in a single .experiment 
directly reveals the identity of all complementary probes. ; 

.,As noted above, ^ a preferred* method of- oligonucleotide - 
probe .array, synthesis involyes the use- of -light, to , direct ,the 
synthesis of. oligonucleotide probes in high-density, . 
miniaturized arrays ... Photolabile 5 ' -protected / 
N-acyl-deoxynucleoside phosphoramidites , surface linker 
chemistry,, and . versatile combinatorial synthesis. strategies 
have been developed for this technology. Matrices, of : = ^ 
spatially-defined- oligonucleotide probes have been generated, 
and the; ability- to use these arrays, to identify complementary 
sequences - haS' been -demonstrated by hybridizing fluorescent 
labeled,.oligonucleotides to the DNA chips- produced by.the 
methods . ; The . hybirid izat ion patter n dembnstra tes a hi gh . degree 
of base specificity and reveals the -sequence of . ; ;: . ... 
oligonucleotide, targets.* - . . 

The. basic .strategy f or light-directed oligonucleotide..- 
synthesis (1) is.,outlined in Fig^. 46.. The surface of a- solid 
support modified. with photolabile protecting. groups (X) ,is : 
illuminated through a photolithographic mask, yielding 
reactive ^hy^rqxyl -- groups ' in -.the illuminated regions . A.; -; 
3.',-0-phosphoramidite ;actiyated. deoxynucleoside . (protected at 
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the 5'-hydroxyl with a photolabile group) is then ■ presented to 
the surface and coupling occurs at sites that were exposed to 
light. Following ■ capping, and ' oxidation , the substrate is* 
rinsed and the surface illuminated through a second mask, to 
5 expose '.additional, hydroxyl groups for coupling. - -A second - 
. 5 ' -protected , ' 3 V-O-phosphoramidite act ivated deoxynucleoside 
is presented to the surface. • .The selective photodeprotectaon ' 
and coupling cycles are' repeated until "the desired set of ' • 
products is obtained. - ■ • ■:- 

10 Light ^ directed chemical synthesis- -lends- itself • to highly 

efficient .synthesis strategies which- will generate a maximum 
number of compounds in a -minimum number of chemical steps. - 
For example," the complete set of 4^ polynucleotides length* 
- n) / "or any -subset- of ' this set can be- produced in only '4 x' n 

15 chemical steps.- See Fig. 47 . " The patterns of il-lumination' 
and :the order of chemical reactants u-ltimately define the^ 
products and their locations. - Because photolithography is - 
used/- the process' -can be^ miniaturized to generate high-density 
arrays- of oligonucleotide probes. For an 'example- of the 

20 " nomenclature useful- for- -describing such arrays an array ' • ^■ 
containing all possible' octanucleotides bf - dA and dT is ' ^ - 
written as (A+T)®-'- Expansion' of - this polynomial"^ reveals ' the 
identity of al'l 256 octanucleotide probes from AAAAAAAA to 
TTTTTTTT.' ' A- DNA- array- composed of complete sets of* - ' ; 

25 dinucleotid'es- as' referred "to as having' a cdmplexitiy of "2/ "The 
array given- by -(A+T+C+G)8 is the full- 65-/536- octanucleotide 
array • of ' complexity four. * Computer-aided . methods of laying' 
down- predesigned arrays^ of probes using VLSIPS™ technology ' are 
described in commonly-assigned co-pending^ application- USSN - 

30 : 08/249-, 188-, filed May 24 , 1994 ( incorpbrcited by " reference "ih 
its entirety for all= purposes-) . ■ ^ ' . ' 

To carry out hybridization of DNA targets ^' to' the probe 
arrays , - the ■ arrays are mounted- in a • thermostatically - ' • 
. controlled hybridization chamber . • Fluorescein - labeled DNA - 

35 targets are - injected' i'ntb .the. chamber , and hybridization is '- 
allowed: to-^ proceed- for- 5'min' to- 24 hr The-^siirf ace ■ of -the - 
matrix is - scanned -in an ■ epifluorescence --microscope **(Zeiss' -^^ 
Axioscop" 20 equipped with - photon - counting electronics^* using 
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50r- 100 of '488 nm .excitation from an ..Argon ' ion ^'.laser :/ ', : 
(Spectra Physics Model "2020) . Measurements may be made with 
the target solution in contact with the probe matrix or after 
washing. Photon counts are -stored and image files .are; . . 
presented after conversion to an eight bit image format. See 
Fig-. 51. : V. ..; . ^. y ■.: \ \ \ ^ • - 

When- hybridizing a. DNA target .to an oligonucleotide- 
array,- N. = Lt-(Lp-l)- complementary hybrids are expected, where 
N'. is the. number ' of hybrids, Lt ,is the length of the /DNA. 
■ target, and Lp is the length -of the oligonucleotide probes on 
the array. . For example, for an ll-mer target hybridized to an 
octanucleotide array, N =='4. . Hybridizations with mismatches 
.at positions that are .2; to 3 residues from either end .of the 
probes will generate. detectable signals. Modifying, the abbve . 
expression for N, one arrives: at .a relationship estimating/ the 
numbers ofr detectabl^e- hybr idizations^^^^ DNA;- target of : 

length* Lt and an array of complexity C. Assuming an average 
of ' 5 positions giving signals, above? background:: 
Nd = , (1 +;5(C-1) ) [Lt-(Lp-l) 3 . . . . . / . _ 

. Arrays of. .oligonucleotides 'can. be efficiently .generated 
by light ^directed. synthesis and can be used. to determine :the . 
identity -of' DNA target sequences. Because combinatorial 
strategies; are used, the number of compounds increases 
exponentially while the number of chemical coupling cycles 
increases only. linearly. For example, synthesizing .the 
complete; set of . 4- ,(65> 536) octanucleotides will add only four 
hours.to the synthesis for the 16 additional cycles . 
Furthermore, .combinatorial, synthesis strategies can be 
implemented to generate arrays of any desired composition.. 
For ;example/V^ ghe . entire set of dodecamers {A^^} - can be 

produced in: 48 photolysis and coupling cycles . (b"^ compounds 
requires b,x n cycles), any, subset of the dodecamers 
(including 'any subset, of shorter oligonucleotides) can be- 
constructed with' the . correct lithographic mask design: in- 48. or 
fewer chemical coupling steps." In additron> the number of 
compounds in an ' array is limited only by the." density : of. : 
synthesis: sites :and .the overall array size. .^Recent ... 
experiments have demonstrated hybridization to^probes : . „ 
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synthesized , in '2 5 -/^m . sites . . At this , resolution the entire;. ' 
set of .65,536 octanucleot ides can be placed in an\array 
measuring .0 .:64 cra.sguare, and the set of 1,048,576 
dodecanucleotides requires only a 2 .56 cm array . • , - . 
5 .Genome sequencing projects will ultimately be limited -by 

DNA sequencing technologies. Current sequencing methodologies 
are highly .reliant on .complex procedures and require 
substantial manual effort. -^Sequencing by hybridization has * 
the potential -for -transforming many of the manual efforts into 

10 more effdcient' and. automated formats/ > Light-directed 

synthesis, .is an . efficient means for large scale production - of 
miniaturized . arrays, for SBH. /.The oligonucleotide arrays are 
not limited :to,, primary sequencing applications • Because 
single- base . changes cause' multiple changes in the 
.15. .hybridization pattern,., the oligonucleotide' arrays provide a 

powerful means to check the accuracy of previously elucidated 
DNA sequence/- or to scan for changes within a sequence. In 
the case of octanucleotides, a single base change, in the 
target DNA results in the loss of eight complements; and 

20 generates, eight new,- , complements.. / Matching: of ' hybridization 
patterns may be useful in resolving sequencing ' ambiguities 
from standard gel techniques ,- or for rapidly " detecting ■ DNA * 
mutational events ./ The potentially very high • information 
content of light-directed - oligonucleotide arrays will' change 

25 genetic -diagnostic"^ testing . ' Sequence comparisons of hundreds 
:to. thousands of." different genes-will be assayed simultaneously 
instead of the , current one, or few at a time' format-. Custom - 
arrays can .also be constructed to contain genetic markers for 
the .rapid' identi'f ication * of a wide : variety * of pathogenic ' 

3 0 organisms,.. > . - " : " :\:: ■ ■ \ : / 

iOligonucleotide arrays can also- be applied to study . the 
sequence specificity of RNA or' protein-DNA I interactions ' ' v. 
Experiments: can be designed. to elucidate, .specif icity rules" of 
non Wa.tson-Crick oligonucleotide structures' or. -to investigate 

35 the. ..use, of. novel synthetic nucleoside analogs* for antisense -or 
triple helix' appldcations.,' Suitably protected RNA monomers - • 
may be employed* for/- RNA synthesis. The' oligonucleotide ':a:rrays 
should * f ind:' broad application deducing \the thermodynamic and 
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k'inetic: -ru-les'> governing.' forinatlon^ and, .stability .of: ^ 
qligpnucleotide - cbinplexesl. , _ , . , , . ; ^ 

Other than the use of photoremovable protecting groups, 
the nucleoside coupling chemistry is very similar to; ;that used 
routinely today for oligonucleotide .synthesis . Fig. .48 shows, 
the' deprotection, coupling, and oxidation steps of a solid 
phase DNA synthesis method.. Fig. 49 .shows an illustrative 
synthesis route for the nucleoside building blocks used in the 
method. Fig. 50 shows a preferred photoremovable protecting 
group, MeNPOC; and hpw to prepare the group in active form. . 
The procedures described below show how to prepare these . 
reagents. The nucleoside building blocks are 
5 • -MeNPOC-THYMIDINE-3 ' -OCEP ; 5 • -MeNPOC-N'^-t-BUTYL 
'PHENpXYACETYL-DEOXYGYTIDINE-3 •-OCEP; 5 ' -MeNPOC-N^-t-BUTYL 
15 PHENOXYACETYL-DEOXYGUANOSINE-;3 ' -OCEP; and 5 ' -MeNPOC-N^-t-BUTYL 
PHENGXYAeETYL-pEOXYADENOSTNE-3 » -OCEP .^^ 

1. Preparation of 4 , 5-methvl enedioxv-2-nitroacetophenone 



10 



20 




HNO3 




- A solution of . 50 g :*(0 .305 mole); 3 , 4-methylenedioxy- 
: acetophenbne (Aldrich) in 200 -mL glacial acetic acid was added 
dropwise over 30 minutes to 700 mL of cold (2-4*^0) 70% .HNO3 
^.25^^^^ with stirring '{NOTE: the react ion wxl 1- overheat .withbut ^ ^ 

external cooling from an ice bath, which can be dangerous, and 
lead to side products). At temperatures below. O.'^C,. however, 
the reaction can be sluggish. A temperature of 3-5«c seems to 
be optimal). The mixture was left stirring for another 60 
30 minutes at 3-5°C, and then allowed to approach ambient 

temperature. Analysis by TLC (25% EtOAc in hexane) indicated 
■ complete conversion of the starting material within 1-2 hr. 
When the reaction was complete, the mixture was poured into "3 
liters of crushed ice, and the resulting yellow solid was 
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filtered off, ..washed with: water and: then ; suet ion-dried Yield 
"53 g (84%), used without further purification, j-— ' _ . 

2. ' Preparation of 1- r 4 : 5-Methvlenedioxv-2-nitrophenvl) 
5 ' ethanbl ■ ■ - ' ^ ^- - . ^ , . : ^ ; : : . . 




Sodium borohydride (lOg; 0.27 mol) was added slowly to a cold, 

10. ..stirring suspension of 53g J0.25 mol) of - . ... 

. 4 , 5-methylenedioxy-2-nitroacetophenone in 400 mL methanol. 
The temperature was kept below 10**C by slow addition 'of the 
NaBH4 and^ external cooling with an ice bath. Stirring was. 
continued at ambient temperature for " another [ two hours , at v^v 

15 which time TLC (CH2CI2) indicated complete conversion of the 
ketone. The mixture was poured into one liter of- ice-water 
and the resulting suspension was neutralized with ' ammonium 
chloride and then extracted ' three times with. 400 mL CH2Ci2 or " 
EtOAc (the product can be collected by' filtration and washed - 

20 at this point, but it is somewhat soluble in water and ; this 
results in a yield of only "60%). The combined organic 
extracts were washed .with brine ,': then dried, with NgSO^^^and 
levaporated.' The crude product was purified from, the main-. -: 
byproduct by dissolving, .it. in a minimum volume of CH2CI2 ; or 

25 THF('175 ml) and then precipitating „it by slowly- adding hexane^ 
(10.00 ml), while -Stirring (yield 51g;. 8 0% overall) Jt can -.; 
also be recrystallized .(e .g. , toluene-hexane)- but; this .... 
reduces the yield.. . : - , . . • , , . : : 
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1^ Preparation .of :i"r4, 5-' Tnethv-lenedioxv"2-nit:rophenvl-) ethyl 

chlorof orinate IMeNPoc-Cl ) : ; ; _ . V : . . , . , 



15 



20 



NO. OH 




NO5 O CI 



COCI9 



Toluene/THF 




Phosgene (500 mL of 20% w/v in toluene from Fluka: 965 nunole; 
4 eq.) was added slowly to a cold, stirring. solution- of 50g 
(237 nunole; 1 eg. ) of 1- (4 , 5-inethylenedioxy-2-nitrophenyl) 
ethanol in 400 mL dryTHF: The solution was stirred overnight 

temperature at which point TLC (20%- EtjO/hexane) ' 
indicated >95% conversion. The mixture was evaporated (an ' 
oil-less pump with downstream aqueous NaOH trap is recommended 
to remove the excess phosgene) to afford a viscous brown oil. 
Purification was- effected by flash chromatography on • a short 
(9 x 13 cm) column of silica gel eluted with 20% EtjO/hexane. 
Typically -SSg (85%) of the solid yellow MeNPOC-Cl is obtained 
by this procedure. The crude material has also been 
recrystallized in 2-3 crops from l : i ether/hexane . On this 
scale, -100ml is used for the first crop, .with a few percent 
THE. added to aid dissolution, and then cooling overnight at' 
-20<»C (this procedure has not been optimized) . - The product 
should- be stored desiccated at -^20«'C. 
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. 4 . Synthesis of 5 ' - Menpoc-2 ' -deoxynucleoside-3 ' - 
fN . N-diisopropvl 2*cvanoethvl phosphor am id ites . : 
(a . ) 5 ' -MeNPOC-Nucleosides 



Base ^ ^ Base 

^ >^ MenpocCI ''""^^^^ 



PCT/US94/12305 





Pyridine " 

HO '.HO 



Base= Thymidine (T) ; n-4-isobutyryl 2 ' -DEOXYcytidine (ibu-dC); 
N-2-PHENOXYAGETYL .2 'DEOXYGUANOSINE (PAC-dG) ; and. 
10 N-6-PHENOXYACETYL 2 VDEOXYADENOSINE (PAC-dA), 

All-four, of the 5 ' -MeNPOC . nucleosides were prepared from the 
base-protected 2 V-deoxynucleosides by the following procedure. - 
The protected . 2 ' -depxynucleoside (90 ininole) was dried by - 

15 . cp-evaporating twice with-250 mL anhydrous pyridine. The : 

. nucleoside ; was ; then dissolved in 300 itiL anhydrous pyridine (or., 
1 : 1, pyridine/DMF, for. the dG^^^ nucleoside) under argon and 
cooled to. *2*C in an dee bath.; A solution of 24. 6g (90 
mmple) MeNPOC-Cl in ^ 100 . mL dry THE was then added with 

20 stirring over 30 minutes. The ice bath was removed, and the 

solution allowed .to . stir . overnight at room temperature (TLC: v: 
5-10%.MeOH in CH2CI2; two diastereomers) ^ After evaporating, 
the solvents- under vacuum, the crude material .was .taken up in 
250 mL ethyl acetate .and extracted with saturated aqueous * . 

25 NaHC03 and brine. The organic phase was then dried ^over . 

Na2S04^ filtered and evaporated to obtain a yellow foam. The 
crude products were finally purified by flash chromatography 
(9 X 30 cm silica gel column eluted with a stepped gradient of 
2% - 6% MeOH in CH2CI2) - Yields of the purified diastereomer ic 

30 mixtures are in the range of 65-75%. 
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(h. ) ' 5 y -■:-Menpoc-2 ' - deoyvnucleoside-3 ' - rN , N-di i sopropvl -. 
2— cvanoethvl • phosphoraTnidites^ • 



Menpocp 




HO 



Base 



■ . ■ Menpocd 
■Amiditing reagent* 



DiEA/DCM 



Ck^ Bas€ 




OCHjCHgCN 



10 



15 



2 0 



The four deoxynucleosides were phosphitylated using either 2- 
cyanoethyl- N- diisopropyl chlorophosphoramidite , or 2- 
cyanoethyl- N,N,N' ,N'- tetraisopropylphosphorodiamidite . the 

.^^•^-^^^^^"^..^^.^ P^.?.^^^^^^- .4 ml; 55 

mmole) of 2- cyanoethyl- N,N,N\N'- tetraisopropylphosphoro- 
diamidite to a solution of 50 mmole 5'- MeNPOC-nucleoside and 
4-3g (25 ininole) diisopropylaimoniuin tetrazolide in 250 mL dry 
^,^2 ^.^2 .^^^^^ argon at ambient temperature. Continue stirring 
for 4-16 hours (reaction monitored by TLC: 45:45:l6 ' ' 

hexane/CH2Cl2/Et3Nj : Wash - the organic phase with saWrated ' 
agueous 'NaHCO^ and "brine, then dry over Na^SO^, and evaporate 
to dryness. Purify the crude amidite by flash ; chromatography 
(9 X 25 cm silica gel column eluted with hexane/GH2Cl2/TEA - 
.45r45:l0 for A, T;. or 0:90:10 for G) . ' The yield of 
purified amidite is about 90%. ^ ' ' 



2 5 



30 



B 



PREPARATION OF L ABELED" DNA /HYBRTDI ZATION TO.. ARRAy I 



...^CR amplification reactions are typically conducted in a 
mixture composed of, per reaction: I '^xl' genomic DNA; 10 ]il 
each primer (10 pmol//xl stocks) ; lo ^1 10 x PGR buff er (lOO mM 
'^^A^/^^L./P^S. 5/500' mM KGl, 15 rm MgCl2) { 10 '^1 2' mM dNTPs ' ' 
f^of .^IPO dNTP>tocks) ; ^2 . 5 U Tag ^polymerase (Perkin 
Elmer Amplil^g™/ 5 and to" 100 ^1, The' cy^^^ 

conditions are usually 40 cycles (94°C 45 sec, 55*»C 30 sec, 
72*»C 60 sec) but may need to be varied considerably from 
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sample _type to sample type -These conditions are fpr 0.2 mL 
thin wall tubes in a Perkin Elmer 9600 thermocycler . See 
Perkin Elmer 1992/93 catalogue for 9600 cycle rime 
information. Target, primer length and sequence composition, 
5 among other factors may ..also affect parameters. 

For products in the 200 to 1000 bp size range, check 2 ^1 
of the reaction on a 1.5% 0,5x TBE agarose gel using an 
appropriate ..size standard (phiX174 cut with Haelll is 
convenient) . The PGR reaction should yield several picomoles 

10 of product. It is helpful to include a negative control 

(i.e., 1 Ml instead of genomic DNA) to check for possible 
contamination. To avoid contamination, keep PGR products from 
previous experiments away from later reactions, using filter 
tips as appropriate. Using a set of working solutions and ' ' 

15 storing master solutions separately is helpful, so long as one 
does not contaminate the master stock solutions. 

For' simple amplifications of short fragments from genomic' 
DNA it is, in general, unnecessary to optimize Mg^* 
concentrations. A good procedure is the following: make a 

20 master mix minus enzyme; dispense the genomic DNA samples to 
individual tubes or reaction wells; add enzyme to the master 
mix; and mix and dispense the master solution to each well, 
using a new filter tip each time. 

25 2 . ^ PURIFIGATION ' 

Removal of unincorporated nucleotides and primers from ' 
PGR samples can be accomplished using the Promega Magic PGR 
Preps DNA purification kit. One can purify the whole sample, 
following the instructions supplied with the kit (proceed from 

30 section'IIIB, 'Sample preparation for direct purification from 
PGR reactions'). After elution of the PGR product in 50 ^1 of 
TE or H2O, one centrifuges the eluate tor 20' sec at 12,000 rpm 
in a microfuge and carefully transfers 45' '^1 to a new 
' microfuge tube, avoiding any visible pellet. Resin is * 

35 sometimes carried over during the elution step. This transfer 
prevents accidental contamination of the linear amplification 
reaction with 'Magic PGR* resin. Other methods, e'.g:, size' 
exclusion chromatography, may also be used. 
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' . " ' 3 . Linear \amplif i cat: ion • _ - ■ , - ^ . ... . 

In a 0.2 itiL thin-wall PGR tube mix: 4j/xl purified PGR 
product; 2 ^1 primer (10 pmol/^1) ; 4 ^1 10 x PGR buffer; 4 ^1 
dNTPs (2 r mM dA,^ dC,; dG, 0 . 1; mM dT) ; 4 ./xl 0.1 mM dUTR; 1 ^il 1. 
5 mM fluorescein dUTP (Amersham RPN 2121).; 1 U Tag polymerase 
(Perkin Elmer , 5 U/Atl).;: and add H20 to 40 , ^1 . . ^Conduct 40 
cycles.. (92?CO0 secy 55^G 30 sec, .72 ''G. 90 sec) of PGR.. .-^These 
conditions have been, used to .amplify a 300 nucleotide 
mitochondrial DNA fragment but are applicable .to other 
10 fragments. Even in the absence of, a., visible product band on 
an agarose gel, there .should still be enough product to. give 
an easily detectable hybridization signal... If one :is not. 
treating the DNA with uracil DNA glycosylase (see; Section 4), 
dUTP can be omitted from the reaction. ; ^ . 
15 ■ ■ _ ..... . .V ... . 

^ . ^^-^^ 4 . Fracrmentat ion . , / . . _ . . : 

Purify the linear amplification product using the Promega 
Magic PGR Preps DNA purification kit , as per Section 2 above. 
In a 0.2 . mL thin-wall - PCR^ tube; mix: 4 0> /il purified .labeled 
20 DNA; , .4 /xl; 10 X PGR .buffer; and .0 . 5 /id- uracil DNA- glycosylase 

(BRL IU//1I) . Incubate the mixture 15 min. at 37.°G, then 10. min 
at 97*'G;- store at -20*G until ready to use. 

5. Hybridization, Scanning & Stripping 

25 A blank scan, of the slide in hybridization buff er* only -is 

he.lpf.ul tO:.check; that:;the. slide, is .ready^f or use. -The buffer 
is removed from the flow cell and replaced. with. 1 mL of 
(fragmented) DNA an hybridization buffer and mixed well . The 
scan is performed, in the presence of , the labeled target. Fig, 

3 0 51/illustxate ;|Ln, illuistf atiye ^ ^d system for scanning^ a 3 

DNA . chip, _ A . series of scans at 30 min intervals using a 
hybridization temperature of 25°C yields -a very clear signal-, 
usually,, in; at least 3.0 min .to . two. hours , but; it may be . - 
desirable to hybridize longer 4-^-/ overnight. Using a- laser 

35 power; of 5.0 //W- and 50 yxm: pixels, one should obtain maximum . ^ 
counts in, the range; of . hundreds to „low .thousands/pixel for -a 
new slide.. ; When f inished ,^ .the slide -can be- stripped using -50% 
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forTnamide. rinsing well in deionized' H2O,., blowing dry and 
storing' at room temperature. • .... 

C. PREPARATION- OF LABELED RNA /HYBRIDI Z ATION- TO ' ARRAY ' 

5 • 1/ Tagged primers • - . . : . . . 

The primers used to amplify the target nucleic acid - . 

should have promoter sequences if one desires to produce. RNA-. 

from the amplified .nucleic acid. ' Suitable promoter .sequences 

are shown below and include : - 

10 (1) -the T-3 promoter sequence: - ~ . ^ . : ' .. ' ' 

5 • -CGGAATTAACCCTCACTAAAGG --^ ^ r. 

5 ' -AATTAACCCTCACTAAAGGGAG ; ■ ' ■; 

(2) the T7 promoter sequence: • - . • . ./'J^- 

5* TAATACGACTCACTATAGGGAG ; - ' ' - * 
15 and (3) the SP6 promoter sequence: 

5, ATTTAGGTGACACTATAGAA .* ■a . ; ,: . , ^ . V ! , 

The* desired promoter sequence- is added to the 5'' end of the - 
PGR primer • It is convenient to add a 'different promoter to- tr 
20 each- primer- of a -PGR primer pair so that either • strand may be 
• transcribed from a' single PGR product. ' : ' . . : : 

Synthesize PGR- primers so as to leave the DMT- group* on. - 
DMT-on purification is unnecessary for PGR but appears to be 
important for transcription.. Add 25 jil •0.5M NaOH to .v 
25 collection' vial prior to collection of origdnucleiotide to keep 
the DMT group on. Deprotect' .using standard- chemistry --^ 55^G 
overnight ds convenient.- - / • ■ " "- ^■^•r - ^ \ . . 

^ HPLG' purif ication is- accomplished by drying down • the / 
oligonucleotides^ resuspending - in *l'mL 0.1- M- TEAA. '-(dilute 2;o 
3 0 M' "stock in 'deionized water/ filter 'through 0'.^2 'micron^ filter) 
■ and filter ^through 0.-2 micron filter-.-*" Load- • 0*.5- mL- oh- reverse 
phase HPLG (column can be ^a Hamilton' PRP-1 semi-prep," #79426) . 
The gradient is ' 0' -> 50% CH3GN over- 25 min (program • 0 . 2 - • 
^mol. prep. 0-50',' 25 min) . Pool- the desired- fractions, dry- down, 
35 resuspend- in- 200 ^1' 80%- HAc. 30 min- RT. ' "Add- 200-^1'- EtOH; dry 
down.' "Resuspend in" 2 0 0^/x-l H20-,*' plus 2 0" >xr NaAc pH5, 5 ,. 600 -/il 
EtOHV" Leave 10- -min ^on- ice ;* ^centri-f uge- 12.,-:000 .rpm-. f or: .10- min^ 
in microfuge. Pour off supernatant. Rinse pellet with 1 mL 
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EtOH , .dry) .resuspend in 200 ^1 H20 ..-. Dry., resuspend in,. 200 -^1 
TE. Measure .A^2 6 0 / prepare a ip pniol/pl solution in_TE X^r^^^^^ 
. Tris.Cl pH 8.0,.,0. 1- mM EDTA) . , Following HPLC purification of 
.a .4 2 mer, a, ;yield ^in- the vicinity -of 15 nmol from, a 0.2 ^mol 
5 scale synthesis is typical. ... 

; / ■ ' 2 Genomic DNA- Preparation - ... 

-Add ,500 7x1 , (10, mM Tris.Cl pHS.O, 10 mM EDTA, ^ 100 mM 
NaGl,. 2% . (w/v) -SDS,. 4 0.iriM DTT filter sterilized to^- th^e - . 

10 " sample, ,Add , 1 . 25 . jil 2 0 mg/ml proteinase. K (Boehringer) 
Incubate at SS^C. for 2. hours,, vortexing .once or twice. 
Perform 2x '0..5 mL .1:1 phenol :.CHCl3 extractions. After each 
extraction, centrifuge ;12 ,000 rpm ;5 min in a microfuge and - 
recover 0 . 4 mL supernatant *. . Add ,3 5 ^1 NaAc ^ pH5 . 2 : plus 1 mL 

15 EtOH, Place , sample ..pn ice 4 5 min; then centrifuge 12 , 00 0 . rpm. 
3 0. .min.,,. rinse . air dry - 3 O^^^min , arid resuspend . in MOO ^1 ^TE ; 

■3. PGR \ . :. ^ , / ■ 

J is, performed in a mixture containing, per reaction: 

20 1. Ml* .genomic.. DNA;, ( 10 'pmol/^l-, stocks) ; '-4 ^1 ■ 

10 X, PGR .buffer .,(100 mM Tris.Cl pH8 , S, .500 mM KGl/ 15 mM r ^ 
MgGl^) ; .4 ^1 2 mM dNTPs (made, from 100 mM dNTP stocks) ; l U ^ 
Tag polymerase .(Perkin Elmer/ 5 U.//xl) ; to 40 ^1-. About 40 

cycles (94-C 30 sec,, 55°C 30 sec, 72 0G 30 sec) . are performed , 
25 but cycling , conditions may need to be- varied. These conditions 
are for 0 . 2, mL ,thin wall tubes, in Perkin .Elmer ,9600 . For - • 
products 4n the.. 200- to^ 1000 bp size range, . check 2 jxl of the 
reaction., on a 1.. 5%', 0 . SxTBE . agarose gel using -an appropriate ^ 
size standard.-. For. larger or smaller volumes .( 20.. - 100^.^1) , 
3 0 one; can: use^^^t^^^ but ad just the - - - 

other /ingredients accordingly. . , . i - , . 

- Iny vitro transcription - . . . , 

■ Mix.:.: 3 >il. PGR product; A 5x buffer;. 2 ./il - DTT ; ■ 2 . 4 r^l 
35 10 mM rNTPs (100 mM solutions f rom Pharmacia)^;. 0 ,.48- 7x1 .10 ^mM 
f luorescein-UTP (Fluorescein-12-UTP, 10 mM solution, from 
Boehringer Mannheim); 0,5 fil RNA polymerase (Promega T3 or T7 
RNA polymerase);, and add to 20 /xl. Incubate at 37°c for 3 
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h- '■Check 2-^1 of the - react ion on 0 .-SxTBE' agarose gel 

using- a 'size standard: 5x buffer is 200 mM Tris pH 7.5, 30 mM 
MgCl2,^ 10 ■ mM- spermidine, 50 mM NaCl, and 100 mM DTT- (supplied 
with enzyme). The PGR -product needs no- purif icat ion • and can 
5 be added directly to the transcription • mixture ; A 20 /xl - - • 
reaction is suggested for an initial test experiment and 
hybridization; a 100 ^1 reaction is considered "preparative" 
scale (the reaction can' be scaled up to obtain more target). 
The amount'of PGR product to- add is variable; typically a PGR 

10 reaction will yield several picomoles of DNA.- v If the PGR 

reaction does not produce that much target, then one should ^ • 
increase the amount "of DNA added to the transcription reaction 
(as well as optimize the -PGR); The ratio - of fluorescein-UTP ■ 
to UTP suggested' above is 1:5,- but ratios from 1:3 to 1:10 - 

15 all work well. " One can -also • label with biotin-UTP and detect 
with streptavidin-FITG ^to obtain similar results as with 
f luorescein-UTP detection. 

For nondenaturing agarose gel electrophoresis -of RNA, 
note that the -RNA band will normally migrate' -somewhat ' faster f 

20 than the ' DNA -template- band, -although - sometimes the two bands 
will cdmigrate-. The temperature of the gel can ef f ect' the \L 
migration of the RNA band. The' RNA produced from in vitro -'H 
transcription is quite stable and can be stored for months (at- 
least)* at -20*'C without any evidence of degradation. It can^^v 

25 be ■stored in unsterilized 6xSSPE 0.1% triton X-ioo at -2*0°G * 
for days (at least) ^and reused twice (at least) for ' "^-^ 
hybridization, -without taking any special' precautions in' ' 
preparation or during use'. RNase contamination- should* of 
course -be avoided. 'When ^extracting RNA^ from' cellsv it ' is 

30 preferable to- work very rapidly and to' use^ 'strongly ' denaturing 
conditions. Avoid using glassware previously contamjjiated ^* 
with RNases. Use of new disposable plasticware (not 
necessarily sterilized) is preferred,,' as - new plastic- tubes , 
tips-,- etc. are essentially RNase free.- Treatment with DEPC 

35 or autoclaving is typically not necessary^ ^ - - ■ 
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- " ' 5;-' Fracrmentat ion " *" ^' - . 
'* ' . Heat transcription mixture for forty min. 

The extent of fragmentation is controlled by varying' Mg^* 
concentration (30 mH is typical) temperature, and duration of 

■ 5.- heating. ^ "■■^ 

6. Hybridization; Scanning/ & Stripping ' * 
. ;/ ;a- blank scan of the slide in hybridization buffer only is 
helpful' to 'Check that the slide is ready for use. The-buffer 
is removed -from the flow cell and replaced with 1 mL* of - 
10 (hydrolysed) RNA in hybridization buffer and mixed well^- 

Incubate for 15 - 30 min at 18*^C.^ Remove the hybridization 
solution, which can be saved for subsequent experiments. 
.Rinse the flow cell 4-5 times with fresh changes of 6 x SSPE 
/ 0.1% Triton X-100, equilibrated - to 18 «Cv The rinses can be 
15 performed rapidlyy- but it is important to empty the flow cell 
- -bef ore-- each -new: rinse^ and: l iquid in -tSe^ ceir-^ ^^^^^"^ 

thoroughly. A series of scans at 30 min; intervals using a 
hybridization temperature of 25°C yields a very clear signal, 
usually in at- least 30 min to two' hours / but it may rbe ^^ - 
20 desirable-tb hybridize longer , a . e .v overnight . Using a la^ser " 
power of 50 ^^W and 50- pixels / one -should ' obtain ■maximum^ ' 
counts in the range of 'hundreds to low thousands/pixel- for a 
new slide.' V?hen finished, the- slide can be stripped using 
warm water . ' - ' 

25 These conditions are illustrative and assume a -probe - 

length^ of- -15' nucleotides . The^ stripping conditions - suggested 
are fairly severe^ ' but- some ^signal* may remain -on the slide if 
the washing is not ' stringent . - -Nevertheless , the counts ' • 
remaining after the wash should be very low in comparison to 
the :signai: an presence of target RNA . ' Xn some cases , much 
gentler stripping conditions -are effective . The -lower the 
hybridization' temperature and- the longer the duration of -- -^ 
hybridization, . the more difficult it is to strip the slide.-* 
Longer targets may. be more difficult* to' strip than -shorter- 

35 targets^-"-' ■' ^' ^ 

^ 7 •' Amplification of- Signal ' " ' - r - • „ 

A- variety ! of methods- -can be lised to- enhance- detection 'of 
labelled targets bound to a probe on the array. In one 
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embodiment, the protein MutS (from E. coli ) . or equivalent 
proteins, such as yeast MSHl, MSH2 , and MSH3 ;. mouse Rep-3, and 
Streptococcus Hex-A, is. used in conjunction with target 
hybridi^zation to detect, probe-target complex that contain 
5 mismatched base pairs. The protein, labeled directly or. 
indirectly, can, be, added to the chip during or after 
hybridization, of. target nucleic acid, .and differentially binds 
to homo- and heteroduplex nucleic acid. . A .wide variety. of., 
dyes and . other, labels can be used for similar purposes... For 
10 instance.,; the dye YOYO-l .is known .to bind .preferentially to 
nucleic acids containing sequences comprising runs, of 3 . or ^ - 
more G. residues. .„ . . . , , 

. . 8. Detection of Repeat Sequences , • , , 

15 . . In some circumstances, i .,e . , target nucleic .acids .with - 
repeated, sequences .or with high. G/C content,, very long -probes 
are sometimes.. required £or .optimal detection. In one - 
embodiment ^f or detecting specific sequences, in a target 
nucleic acid with a DNA chip,.^repeat sequences are detected as^ 
20 follows. : .The chip .comprises probes. -of length.- sufficient- to- - 

extend. .into the repeat region varying distances, from each end... 
The sample, .prior to hybridization, is -treated- with a labelled.^ 
oligonucleotide that is complementary .to a repeat region but, 
shorter than the full length of the repeat. The target 
25 nucleic is labelled with a- second,- distinct- label. After 

hybridization, the chip -is scanned for; probes that- have bound 
both the labelled target and the- labelled oligonucleotide - 
probe; the ..presence .of .such bound- probes-^ shows that- at -least^ 
two ..repeat sequences are • present ^ ... ^ ^ - ; ; 

3 0 :1 ■ - , - ■ - . , • " ; .; ■ • y - 

While the foregoing^ invention has- been described -in some 

detail for .purposes of clarity and understanding, it will be 
clear to one skilled in the -art f rom a reading of this- 
disclosure that various, changes in form and detai 1 . can be made 
35 without departing from the true scope of the invention.; . All: 
publications and patent documents- cited . in this^ application 
are. incorporated .by reference . in. their, entirety . for , all . 
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^ purposes-. to^.the same^ extent ^as if -each™^ individual publication 
V^or. patent document were s individually' denoted. - - v . 
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WHAT IS CLAIMEP- IS:- -•■ .,- .. ,- , .. , ... 
General tiling claims 

^ array of oligonucleotide probes. Immobilized on a 

solid support , the-, array comprising at- least two. sets of. 
3 oligonucleotide probes, 

^ ^^'> ^ first probe set comprising a plurality of 

probes,, each probe comprising a. segment., of- at. least three 
nucleotides exactly complementary ., to a subsequence of . the 
reference, sequence, the segment including at least one ': . 
interrogation position, complementary to a corresponding 
9, nucleotide in .the reference - sequence , .. -. 
^° - (2) a second probe. set. comprising a corresponding 

11 probe .for . each probe in ■ the- f irst ; probe set , the.-.corresponding 

12 probe in the second probe . set being identical ■ to a ...sequence 

13 comprising the corresponding.^probe. from , the first , probe, set- or 

14 . a...subseguence of - at^^east thre 

15 includes the at least one interrogation position, except that 

16 the . at least one interrogation position is occupied by - a . 

17 different nucleotide in. each of --the two .corresponding probes 
from the .first, and second probe -sets; .. - : 

. wherein the probes in the, first .probe set have at least 
two interrogation. positions respectively corresponding to each 
21 of. two. contiguous .nucleotides in -the reference .sequence. - 



18 
19 
20 



1 
2 



2 . . .An. array, .of. .oligonucleotide . probes - immobili-zed on a 
solid support, the array comprising at least four .sets of ■ ■ 
3 Oligonucleotide probes, 

^ - - • (1) - -a. first probe set comprising- a plurality of 

probes, each -probe ..comp.r.ising a segment- of at least three 
nucleotides exactly complementary -to ' a- subsequence of the - 
reference sequence, the segment including at. least one....... ]■ 

interrogation position complementary to a corresponding 

9 nucleotide, in the. reference . sequence, ■ 

1° - •• (2 ) , •. .second , . third and .-fourth probe sets , . each. • - 

11 comprising, a corresponding -probe -for each .probe in the ■ f-irst- 

12 probe set, the probes in the second, third and fourth probe 

13 • sets ..being.. identical,.to: a sequence comprising,. -the..- . 

14 corresponding probe from .the. first, probe .set or a subsequence 
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of at least three nucleotides thereof that' includes the at' 
least one interrogation position, except that the at least- one 
interrogation position is occupied by a different nucleotide 
in each of the four- corresponding probes froin the four proBe 
sets . ^ 

•-3;- ' The ■ oligonucleotide array of claiin'2,' further ' 
comprising a fifth probe set' coinprising a "corresponding probe 
for each probe in the first 'probe set , -the corresponding probe 
from the- fifth" probe setVbeing identical to a" sequence - 
comprising the corresponding probe- from the ^ first probe 'set 'or 
a subsequence of . at least- three nucleotides thereof that 
■ includes the" at least one - interrogation position, except that 
the at least one interrogation position- is deleted ' in the" 
corresponding-probe from the fifth probe -set 

-4 .---The oligonucleotide array of claim 2, further 
comprising- a sixth probe set comprising -a corresponding probe 
for ■ each probe ■ in the first probe set-, the corresponding prober 
from the sixth probe set -being "identical' to '^a' sequence" ' " ^ 
comprising the corresponding probe 'from the first probe' set or;, 
a subsequence of at least " three nucleotides thereof that ' 
includes the at least one interrogation position, except that"? 
an additional nucleotide is inserted adjacent to the at least' 
one* interrogation -position in the* corresponding probe f rom the 
first probe set. - :.:/.:.: *^ v.. . , \ 

- -' 5.-^' The array of claim 2, 'wherein the -first probe set has 
at least three- interrogation positions respectively • ' 
corresponding -to each- of three contiguous nucleotides in' a - 
reference sequence. - * . '• ' : 

6. The array of claim 2, wherein the- first probe" set has 
at least 50' interrogation positions respectively corresponding 
to -each of 50 contiguous nucleotides'- in a reference- sequence . 

7. The array- of - claim -1 or 2^ wherein* the^ first • probe~ 
set -has- ait least -100 -interrogation positions 'respectively " ' 
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3 corresponding to each - of 100 - cont iguous- ;nucleot.ides in~ a 

4 vreference sequence. - ; - . , 

1, 8, The oligonucleotide array of claim 1 or 2 , wherein 

2 the .first probe set has an interrogation position ■ 

3 corresponding. to each of at least-30% of the nucleotides in a 

4 reference sequence and the reference sequence ' comprises" at ■ 

5 least 100 nucleotides. 

1 . 9. ' The oligonucleotide array of claim : 8 , wherein ^the 

2 first . probe set comprises probes which completely span the- 

3 reference sequence, which probes, relative to the reference^ 

4 sequence,- overlap one .another an sequence. , : - ; - - * . 

1 10. The oligonucleotide array of claim 9v wherein the'-.. 

first probe se-t- has an - interrogation pos ition correspoh^ing to 
each of the nucleotides in the reference sequence; 



.11. The. oligonucleotide array of claim 10, wherein the 
probes are oligodeoxyribonucleotides . - ; , : - : 

12.. ..The oligonucleotide array of claim :1 or 2, wherein 
the array comprises, between 100 and 10,000 probes. 

.^^^ oligonucleotide array pf claim 1: .or 2 ,;: wherein 
the array comprises between 10,000 and -lOO , 000. probes - 

-.14. The. oligonucleotide array of claim 1 or 2:, wherein 
the array comprises between- a 00, 0 00 and. 10 , 0.00 , 000 .probes:. • 

15.„. The oligonucleotide array of claim, l, or 2, wherein 
the probes. are linked^-to the support- via a- spacer.; . - 



16. The, oligonucleotide array of claim l--or 2,- wherein: 
the. .segment in^^each. .probe of. the first • probe: setr that, is' ' 
exactly complementary to the subsequence of .the reference :■ 
sequence -.is 9-21 - nucleotides - • " , - r ^ -^.-^ 
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:17 .' 'The oligonucleotide'. 'array * of *claiTn 16, ' wherein the 
segment is n nucleotides long, and the subsequence is" at least 
n-2 nucleotides long. . 

18. The' oligonucleotide" array ' of ■ claim 1 - or 2 , wherein 
each probe- of- the first probe' set consists of the segment that 
is exactly complementary to . the subsequence of the -reference - 
sequence . - - ^ . 

- ■,19;. The oligonucleotide array -of claim 1 -or ' 2*/ wherein 
the probes ; in-the second, ' third and fourth 'probe sets are - ■ 
identical;to the corresponding probe from' the first probe set 
except that the at least one interrogation' positaon- is • 
occupied by a different nucleotide in each of the four 
corresponding probes from the four probe sets*.- : • ' . 

20. The 'array of claim 2, further comprising fifth, ' " 
sixth and seventh probe sets, wherein: 

- : : : . the 'segment bf each' probe in the first set 
includes at least two interrogation' positions" e^ • - *■ ' -' ' ' 
corresponding to a nucleotide in the reference sequence, 

• - the second, third and' fourth- probe sets,' each 

comprise a corresponding probe for each^' probe in the first 
probe set, the corresponding probes in the second, third and 
fourth- probe sets being identical' to a sequence ' comprising the 
corresponding" probe from the first probe- set' or a' subsequence 
of at least three nucleotides thereof that includes a first 
interrogation. position except "that ' the ■ first" interrogation 
position is occupied' by a dif f erent nucleoti'de - in- each of ' the' 
four corresponding probes from the four probe sets; 

■'■'* • - ' ^ the*- fifth, ■ sixth* and- s'eventh'probe sets ; "each 
comprising- a 'corre'spbnding probe ■■ for" each- -probe in "the '-first' ' 
probe set, the probes in. the fifth, sixth and seventh probe 
sets -being identical to a' sequence' comprising 'the • ^ - ^ '^ 
corresponding^ probe from-: the first - probe set -or- a subsequence ' 
of at' least: three 'nucleotides' thereof that -includes a ' second ' 
interrogation position, except that ■ the 'Se'cdnd ihteffogation '' 



wo 95/1 1995 ,.PCT/US94/12305 

22 position is^occupied by.-, a: different; nucleotide in-,-.each' of the 

23 four corresponding probes from the four probe sets, 

1 ... 21. The* array of claim 2, wherein each probe in, the 

2 first^.probe set further comprises a second, segment of at least 

3 three, nucleotides exactly ^complementary tQ^ a second 

4 subsequence, of. the reference sequence,- and, the probes : from the 

5 second, third and fourth probe sets comprise the corresponding.. 

6 probe from the first probe set or a subsequence thereof 

7 comprising the. first and second segments except in the at 

8 least one interrogation ppsition.. . , . , . 

1 -;■ 22;. ..^The; array ; of claim 2, further,comprising: ^ . 

2 a fifth probe. ^set; comprising at least one probe. 

3 comprising . a segment of at least seven nucleotides exactly 

4 complementary, .t o . a subsequence of ..-the ref erence sequence : 

5 except at one or two positions, the segment including at least 

6 one interrogation position corresponding to a nucleotide in 

7 the . reference sequence not at the one or two positions; 

8 - sixth, , seventh -and eighth probe sets, each. comprising a 

9 probe for each probe . in the fifth probe set, : the corresponding 

10 probes from the sixth, ^seventh & eighth, probe, sets., being 

11 identical, to a sequence comprising the corresponding . probe 

12 from the fifth probe set or a subsequence of ,at least, nine 

13 nucleotides . thereof including. the. at least, one interrpgation 

14 position- andvthe..one,. or twoL positions, except ,in. the ,at least 

15 one interrogation position which. is occupied by a ; different 

16 nucleotide in each- of the four probes. 

1 - ^ 23 . ; The ; array of claim 2 , wherein the probes ;are 

2 - arranged on th^ substrate so that the • f irst set. , of probes, is 

3 arranged in a- row across- the substra.te. in an .order reflecting 

4 the overlap between the probes and the reference sequence, and 

5 the additional' sets of probes -are arranged in columns .relative 

6 to. the, probes : in said, first set so. ,that probes with the same 

7 interrogation ' position- are in- -the same column and so ^that- .each. 

8 column^^ comprises- at:. least. ^.4 probes. «: ... . \ ■ 
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1 



" 24". ' The array'of ■ Claim 2, wherein said probes are 12 . to 
17 nucleotides in length. * 



1 



25 . 



The array -of Claim 2, wherein said -probes are 15 



2 nucleotides in length and- attached by a covalent linkage . to a 

3 site on a- 3 ' -end of said probes,, and said interrogation 

4' position' is ■ located at position 7y relative • -to the -3 * -end of 

5 said probes- ' . v , . 

1 ' ' 26, The array of- claim 2> -further comprises fifth/ 

2 sixth, seventh and eighth probe sets, - ■ -* . . ^ . .j. 

3 (1) a fifth probe set comprising a plurality of 

4 probes, each- probe comprising a segment • of - at -least three 

5 nucleotides exactly complementary to a- subsequence of a second 

6 reference sequence,' the- segment including-at least one ' . 

7 interrogation position complementary to -a corresponding' * • - 

8. nucleotide in- the reference sequence; ^ 
9 '(2)' the" sixth, seventh,- and- eighth probe sets, each- 

10 comprrsing' a corresponding probe for - each probe in the fifth 

11 probe' set; the- probes' ih' the sixth, seventh and eighth'^ probe 

12 * sets' being ^ identical to" a* sequence comprising the * ^ : . , 

13 corresponding probe from the fifth probe set or a subsequence 

14 of 'at ' least three nucleotides thereof that •-' includes the at - 

15 least one 'interrogation position, ' except that the at least- one- 

16 interrogation position- is occupied by a -different • nucleotide 

17 in'*each of the four corresponding probes- from the -'fifth , • 

18 sixthV seventh and eighth' probe sets." ' ' - 

1 • 27. The array of claim 22, wherein the first, second, 

2 third *and fourth probe sets' have probes ' of - a - f ir'st, length and 

3 the' fifth, • sixth, ^ seventh' and eight- probe sets have probes of 

4 a second" length different from the first length-; - — : ■ - * 

Tiling for wildtype and mutant reference sequences - 

1 • 28. An array of oligonucleotide- probes -immobilized on a 

2 solid' support', the- array comprising- at least one pair- of- first 

3 and second probe groups, each group'* comprising- a -f irst and^" 

4 second sets of oligonucleotide probes as defined by claim 1; 
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5 ■, . . Wherein- each probe in' th&: f^i:rst pr^ohQ^sBtf-'f^dm the 

6 first- group is exactly .compTementkry to a -subsequence of a • 
first reference sequence and each probe in the first probe set 

8 from the second group is exactly •■complementary to a • 

9 . subsequence from a second reference ■ sequence . ' 

^ ; 29. :.The . array of claim 28, wherein the second reference 

2 sequence. is a mutated f orm . of the first-reference sequence. , 



1 

. 2 



• 30. -The array .of ■ claim 28, wherein' each -grdiip further ' 
comprises third and fourth probe sets , each comprising a 
corresponding probe for each probe in the first probe set, the 
probes in the .second,, third and fourth probe. sets being - ' 
identical- >to a, .sequence comprising the corresponding probe V 
from the first probe. set or a subsequence -of at least three- - 
nucleotides-thereof that includes the: interrogation position, 

8 except that the interrogation position is occupied by a 

9 different nucleotide in each of the four corresporiding probes 
10 from the four probe sets.^ . ^ 



3 
4 
5 
6 
7 



1 

2 
3 
4 



31.:/^ iThe- array- of claim 30 that .comprises at least five 
pairs of, first and- second probe groups, wherein the probes in 
the. first probe:sets from the^ first; groups of the five pairs 
are exactly complementary . to subsequences, from* five different 



5 respective, first, reference, sequences. 



32. ..The array, of claim. 30 that comprises- at:^ least, forty 
pairs of first and second probe groups , wherein the probes in 
the first probe sets from. the first groups of the forty pairs- 

4 ; are exactly: c^^ 

5 respective, first reference . sequences ..' ) ■ v;-- : ^ 

Block ^tiling ... . ... . ..^ 

1 33. . .An array^^of oligonucleotide probes immobilized -•oh a 

solid support,. the -array.comprising at -least a group: of probes 
3 . comprising: , . . . . . , . 

a wildtype probe comprising a segment of at least three 
nucleotides exactly complementary to a subsequence of a 



2 



4 

5 
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reference- sequence the- segment having* at least first and 
second interrogation posit ions corresponding- to first and - 
second nucleotides in the reference sequence, 

a first set of three mutant probes , each identical to' a 
sequence comprising . the . wildtype. probe or a subsequence- of' at - 
least three nucleotides thereof including the first and second 
interrogation -positions, except in the first interrogation 
position, which is occupied by a different nucleotide in each ' 
of the three mutant probes and the wildtype probe; 

- -a- s^^^o^^ '^^^'t of ,:three' mutant .probes , ' each identical to a 
sequence comprising the-wildtype :probe • or a subsequence 'of * at 
least three nucleotides -thereof including the first and second 
interrogation positions,-, except . in the' second interrogation 
position, ^. which is - occupied by a different nucleotide -in each 
of^ the three mutant probes and the wildtype probe/' ' -* 

34 The- array of claim 33 wherein the segment of the - 
^wildtype - probe -comprises. 3-2 0 interrogation positions- ■ ~ 

corresponding to 3-2 0 respective nucleotides in the reference r.' 
sequence, and the array comprises 3-20 respective sets of 
three mutant, probes.,- each' of' the three probes .'identical' to a 
sequence comprising the^ wildtype probe- or- a- subsequence 
thereof , including -the 3-20 Interrogation positions, except ' 
that-one of- the, 3 -r2 0 interrogation positions is- occupied by a - 
different nucleotide in each of^- the three ' mutant' probes' and ' 
the wildtype probes, the one of the 3-20 interrogation 
positions- being- different in each :of the* 3-20"' respective sets 
of three -mutant probes , ^' - - ' - r.:..:, . a 

• 35, An array :•of^ probes immobilized- to'- a -sol id support 
comprising two groups of probes , each' group -as'- defined by 
claim 33, a first group comprising' a wildtype probe comprising 
a segment exactly complementary to a subsequence of-^'a first' " 
reference '.sequence and . a : second 'group comprising a wildtype 
probe comprising a segment exactly - complementary to ' a - - — - 
subsequence of a second reference sequence- 
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1 - ' 36. ' The array ^of-^ claimv 3 5 , cbniprising at least 10-100 

2 groups of . probers , each (comprising: a^wildtype 

3 a segment exactly complementary to a subsequence of at least 

4 10-100 respective reference sequences , 

Pooled probes ^ ^ . .. .v v 

1 37.. :A;method' of - comparing- a target sequence with a - 

2 reference :sequence/- the method comprising: 
/. -identifying variants of a reference sequence differing - 

from the reference sequence -in at least one nucleotide; - ' 

5 assigning each variant a ^designation, 
providing an array of pools of probes/- each pool - 

occupying, a-rseparate .:cell of the array, wherein each pool ^ 
8 comprises -a. probe comprising a segment exactly- complementary, 
to each ..variant .sequence assigned a particular designa'tron- 

contact inq the array-with -a -target-^sequ^^ comprising a 
variant of. the reference sequence; ' 

12 determining the relative hybridization intensities of the 

13 pools in the .array-to the target sequence; : • ^: 

14 :^ .determ_ining^J:he . target sequence from: the relative-^ ^- - -^ 

15 hybridization; intensities of- the pools. - r . r: -c..:.:. 



3 
4 



6 
7 



9 
10 
11 



1 



38-.- .The method of claim 37, wherein the variants are 
2 assigned numbers according to an error code. 



1 39. The method of claim 37 wherein each' variant is - 

2 assigned a^ :designat ion having at least one- digit and- ^t^ least 



one value^-for. :the^-"digit, .and each pool comprise- a probe 
comprising- a, segment exactly 'complementary to - each variant ■ 
sequence :assigned ..a ,particuia value: in .a particular digit r 



1 40 . ,The .method.,of claim- 39, wherein the • variants are - 

2 assigned :successive,, numbers in a numbering ^system of base m ' ' 
having n . digits, . and- the array comprises -n x (m-r)- pools of 

4 probes,; .^i . r-^">v;^ r . .^..^ 



3 
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1 41-. - The method of*- claim 40,- wherein each • pool . further 

2 comprises a probe comprising a segment exactly complementary 

3 to, the reference sequence . , - . 

Trellis tiling 

1 42. A pooled probe comprising a segment exactly 

2 complementary to- a subsequence of- a -reference sequence . except 

3 at a first interrogation position occupied by -a pooled^:- 

4 nucleotide -N, a -second interrogation position occupied by a 

5 pooled .nucleotide selected from the group zof . three consisting 

6 of (1) M or K, (2) R . or ;;Y and (3) S or W, and a third ' 

7 interrogation position occupied by a second pooled nucleotide 

8 selected from the group/, .wherein the pooled nucleotide - - • 

9 occupying -the second ^interrogation position comprises a - • 

10 nucleotide. :complementary to a .corresponding nucleotide f rom ' 

11 the reference .sequence when the , second pooled probe and ■ 

12 reference sequence are maximally aligned, and the pooled 

13 nucleotide occupying the third interrogation position 

14 comprises a nucleotide complementary to, a corresponding 

15 nucleotide from the- reference sequence' when, the" third-pooled 

16 probe and the reference . sequence are maximally aligned ■ , 

17 wherein N is A, C, G or T(U), K is G or T(U), M is A or C, R 

18 is A or G, Y is-~C.-or T(U), W is A or T(U) and S^is G or C. 



1 43. An array of oligonucleotide probes immobilized on 

2 solid . support the. array- comprising: . . 

• 3 . first , .-second -and-third cells .respectively occupied by ■ 

4 first, secondhand : third pooled probes, each :pooled probe- 

5 comprising -a -segment^ exactly, complementary "to " a -subsequence of 

6 a reference : sequence except :at a . f irst interrogation -position 

7 occupied by a pooled nucleotide N, a second interrogation 

8 position occupied by a pooled nucleotide selected -from the 

9 group of three -^consisting of (1)- M or K/ ( 2 )* ^R . or .:y- and ~ ( 3 )— S 

10 or W,, and, a third interrogation position occupied by - a - second ' 

11 pooled nucleotide selected from the group, wherein the pooled 

12 nucleotide occupying the second interrogation position 

13 comprises a nucleotide complementary to a corresponding 

14 nucleotide from the reference sequence when the pooled probe 
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15 and the reference sequence are maximally:; aligned ,-^andr. the 

16 pooled nucleotide occupying the third interrogation , position . 

17 comprises a nucleotide complementary to a corresponding 

18 nucleotide from the reference sequence when the pooled probe 

19 and ' the: reference . sequence, are ■ maximally : aligned ; i > 

20 \ ' provided that one of the three * interrogation.: 

21 positions in the each of the three pooled probes , is - aligned 

22 with the , same corresponding nucleotide in the* reference 

23 sequence, this , interrogation position being occupied;, by an N~ 

24 in one of ' the -pooled ; probes , and a; different, pooled nucleotide 

25 in each: of /the other, two pooled probes, 

26 , .wherein N is/A,.C, G or T(U) , K is , G or : T (U) , . M is A 

27 or-C, R:isrA,or„G> Y .is , C or • T (U); ,. . W is A,:or-. T(U) and S .is .G■ 
28 ' or -C, ^- - v;^ ■: - ; - : . ,^ ; ^ ^ ^ - ■ ' 

-1- - - 44 . The array- of - claim 4 3 -f urther comprising : ; -^y.r - . . . 

2 fourth and fifth -cells respectively occupied by fourth 

3 and fifth pooled probes, each pooled probe as defined by ^ 

4 claim . 43 _ - i v.-. , ^ - . 

^ ' _ wher;ein -one:.. of 'the -three interrogation position in ;the,^ 

6 second, third and fourth pooled probes - is aligned with the - , 
same corresponding nucleotide in the reference sequence , this 
interrogation position being occupied by an N-^ih one of_the 
9 pooled .probes, and a. different pooled nucleotide in -^each . of 

10 the other two pooled probes, 

11 wherein one of the three interrogation position in the 

12 third,, fourth and fifth pooled- probes .is .aligned with the sam 

13 corresponding nucleotide -^in the -reference sequence , this 

14 interrogation ;position being occupied by an. N in one of. the - 

15 pooled /probes , yand:^a : different • ppoled;nucieotide in leach of - - 

16 the other twq .pooled probes 

^ '4 5, ,;-:;The. array, of claim 44,. wherein . the, , pooled probes are 

2 identical except at the interrogation positions. . . ^ . 

^ 46. The .array of claim 44, wherein the; first, second,., r- 

2 third, ;~fourth' and fifth .pooled probes , are, exactly - 

3 complementary .to.jfiye , respective subsequences ^. of • the -.refer.ence 
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4 sequences." that- from.^ each . other by increments . of one : 

5 nucleotide. . , . - . , : ,\ - 

Bridge tiling 

1 47, An- array- of oligonucleotide probes immobilized on a 

2 solid support, the array comprising at/ least . four probes : 

3 ■ a. first probe comprising first and. second segments, : each 



4 of at least .three' nucleotides- and exactly complementary -to. 

5 first and • second/ subsequences of a reference sequences, the 

6 segments, including'^at- least one interrogation'- position ■ ' , 

7 corresponding to a nucleotide in the reference sequence; 

8 wherein . either (1) the first -and second subsequences are 

9 noncontiguous or (2)- the first and second subsequences are 

10 contiguous and the first and second segments are inverted • 

11 relative to the complement of the first and second 



12 subsequences* in^ the reference sequence; - ;' - •-'■'^ 

13 ' -second, -third and fourth probes, identical to a sequence 

14 comprising the first probe or a subsequence thereof comprising^^^ 

15 at least three nucleotides from each of the first and-second' 

16 segments, - except -in the - at least one- interrogation- position , 

17 which - differs in each of the probes; * ' 

1 - 48. ;The'-array of claim 47, wherein the first and ' second-* 

2 subsequences are separated- by one or ^ two nucleotides in the- ^ 

3 reference sequence. s. . ..>. 

Two interrogation positions (no wildtype) 

1 49. An -array of oligonucleotide probes -immobilized on - a 

2 solid supports the -array comprising • at least -a set of four- ■ 

3 probes, each of the probes -comprising a segment - of -at 'least - 7 

4 nucleotides that is exactly complementary - to - a -subsequence - ■ 

5 from a reference sequence, except that the segment may or may 

6 not -be -exactly complementary at two interrogation-positions, 

7 wherein: • ■ > - - - - - - • - . ^ • - \ i „ . ^ . . 

8 the first interrogation position is occupied by a 

9 different nucleotide in-each of the - four -probes , - ^ • / 

10 the - second interrogation 'position is -occupied ^by -a 

11 different" nucleotide * in each of the -four probes '-^ 
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12 ■ in, firsf and • second^ probes:, the- segment is exactly 

13 compl-einentary to the . subsequence , except at not more than one 

14 of the. interrogation positions, and . .. : 

15 ' in. third , and 'fourth probes, the segment is exactly 

16 complementary, to the subsequence, ^except at both of the 

17 interrogation, positions . ' " • 



1 -50. .. .An . array .of probes immobilized to a' support, the 

2 array comprising at least 100 sets of 4, probes , each set ■ as ' ' 

3 defined.;by claim 49, .the probes from the at least 100 sets 

4 comprising at least .100 respective segments, thesegmerits 

5 having at least 100 . respective first and second interrogation 

6 positions. 



Helper mutations 

^ ^ ^ >=An .array; of oligonucleotide probes" immbbriized on a 

2 solid support, the array comprising a set of probes 

3 comprising: , . ■, . . 

:,• , a first probe comprising a segment of at least 7 
nucleotides; exactly .complementary...to a subsequence of a - 
reference sequence except at one or two positions,- the- segment 
including: an interrogation position not at the- one or two 

8 positions; ^ . : . .. . •. 

9 second, third and fourth mutant probes, each identical to 
a sequence comprising. the wildtype probe or . a subsequence - 
thereof.. including. the..interrogation position and the one or 
two positions, except in the interrogation position, which" is 

13 occupied by -a different nucleotide: in each of the four probes. 



4 

5 
6 
7 



10 
11 
12 



Omissi;(6n..of' Pfijrfectly Matclieci -.Probe.I - : .'i . 

1 :52,.,. An array of oligonucleotide probes immobilized on a 

2 solid support,, the array -comprising at least two sets of 

3 oligonucleotide -probes , . ■ -, .-. ., 

4 (1) ,;, a first probe set .comprising: -a plurality of 
pr9bes, each probe, comprising a .segment exactly complementary 
to a subsequence of at least 3 nucleotides of a reference " ■ 
sequence., except-vat an. :interrogation position,-.- ■ - - ■ . 
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J . ;(2 ) . . ^a- second probe set , comprising' a corresponding 
probe for , each probe in the first probe set> the corresponding 
^'probe in the second probe set being identical to a sequence 
comprising, the. corresponding - probe, from: the/first- probe set or 
a subsequence., of at, least, three -nucleotides' thereof that.. 
includes the interrogation . position , except that the v 
interrogation position is occupied by a different nucleotide 
in each of . the two ■ corresponding probes and the. complement to 
the. reference sequence, . ..... 

.... ^ ..wherein the . probes . in the first probe, set have 'at' 

least .three .interrogation positions . respectively . corresponding 
to each, of , three , contiguous -nucleotides. :in the reference', 
sequence. 

Methods : 

53\.. ' A .method :0.f- comparing.. a target nucleic acid with a 
reference sequence comprising a. predetermined, sequence- of v.:^ 
nucleotides, the method comprising: • 

• (a) ; hybridizing; the target nucleic- acid to an array 1 
of oligonucleotide probes immobilized: on- a solid ^ support; the 
array , comprising: r - ... - - . . - . : . „ - . 

(1) - a. first probe set comprising .*a^^ plurality- of - 
probes, each probe comprising a segment of at least three 
nucleotides, exactly complementary to a. subsequence of the 
reference. .sequence, .the segment including at.:v least -one - • 
interrogation position complementary : to a - corresponding 
nucleotide in - the : reference .-sequence ^ : : ; r . * ^ ' 

. ,(2) .. -a second probe .set ; comprising ::a corresponding 
probe for each probe in the. first probe set, the corresponding 
probe in the second probe set being .identical ^to a sequence'- 
comprising the corresponding -probe from .the first probe set or 
a subsequence of at -least -three nucleotides thereof that • - - 
includes the at least one interrogation position -except ' that - 
the at least . one interrogation . position' i-s occupied by a 
different nucleotide in . each :Of ,the two. corresponding -probes • 
from- the .first and second .probe- sets;; * / /, ^ ' : : . . 

wherein, the probes- .in the: f irst' :probe'. vset< .have^ at-' 
least three interrogation positions respectively corresponding 
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24 to reach- of :at least .three ^nucleotides :in. -the reference- - • 

25 sequence, and - 

26 (b) determining which probes; relative to one v 

27 another , in the array bind .specif ically „to - the; target nucleic 

28 acid,: the; .relative specific binding of the probes indicating 

29 whether the target sequence is the same or different- from the 

30 .reference sequence. 

1 54- • The method; of - -claim -53, wherein the array f urther. ■ 

2, comprises third, and; fourth probe sets, each comprising, a, 

3 corresponding probe for each probe, in. the f irst; probe set, the 

4 probes in the second, third and fourth probe sets being 

^ 5 identical to a sequence comprising the corresponding probe 

6 from the- first probe set. or a subsequence . of at least three ' 

7 nucleotides thereof that includes, \the, at : least one ; . 

. 8 . interrogation position, .except that-the -at ^least one^ - / ■ 

. 9 interrogation position-is occupied by a different nucleotide 

10 in each of - the four corresponding- probes from the :f our probe-.; 

11' sets,/-.. ^ _ . ^ \ ^ ■. , ' * " 

1 55. The method of claim 54, wherein the - target - sequence 

2 has a substituted nucleotide relative to the reference 

3 sequence in at- least. one undetermined position and . the 

4 relative specif ic binding of the probes indicates ^the location 

5 of the position and the nucleotide occupying the . position . in ^ 

6 the targe.t ; sequence . . . , \ 

1 . 56 . The ^ method of claim^,54 , wherein-: 

2 - the hybridizing step comprises hybridizing, the 

■3 target nucle ic . acid and a second . target nuclei c ac id ; to the 

4 array; and , ^ . ^ . . . . ^ 

5 -the determining step comprises determining which.. - 
.6 probes, relative • to, one another , 4n the array bind . 

7 specifically to>.thej target-, nucleic acid or the second target^- 

8 nucleic acid,.,the;_ relative . specific, binding .of the. probes . 

9 indicating whether the^ target .sequence is t the. same or-:- 

10 different from the reference sequence and whether^ the, second. - 
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11 target sequence 'is^ the- same -or different from -the reference 



12 



sequence . 



1 '57. The method of claim 56, wherein the target sequence- 

2 has a'^label- and the second ^target sequence has a second- label 

3 different .from the label. . . ....... . . : 



1 58. The method of claim 56, wherein undetermined first 

2 and' -second proportions of' the first and- second target - 

3 sequences are hybridized to the -array- and the specific -^binding 

4 indicates* the proportions .* , • . . j ; 



3 



2 



1 



-59. . The method- of claim 54/ further comprising:- - • 
- (c) removing the target^nucleic acid from: the array; 
(d) hybridizing a^ second -target nucleic acid to the- 



4 array; • ■ : \ ' , ; ■ . • - 

5 -(e) determining which probes, relative to one another-,- in ' 

6 the- array, bind specif ically. to the second- target nucleic acid^^ " ^ 

7 the relative specific binding of the probes indicating whether 

8 the second target sequence is the same or different from the 

9 reference -sequence. \* ^ ■ . ^ /. ^ 



12 relative to one another , ' hybridize to the target- sequence , the 

13 relative, specific binding -of- the- probes- Indicating-' whether -the 

14 target sequence -is-^the same or- di-f f erent • f rom -the' second-' ' - - 

15 reference -sequence: ' . ^ ,;\ - ' . .ii- : iw.. ,:...'-\..z: 
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1 , ^1- . The method of /claiin: 60/ wherein- -the hybridizing step 

2 cdinprising hybridizing .the target sequence and a second ^target^ 
sequence to the array, and the relative specific binding , of - 

4 ; :the probes from the first - group indicates that the target is 
5 identical to the : first reference -sequence -and the relative" 
specific binding of the probes from the second group indicates 
. 7 that the -second- target sequence is identical to the second / 
8 reference sequence. 

1 .62.> The method of. claim 61, wherein :the first and second 

2 target sequences are heterozygous alleles of^ a gene\^^ 



3 



6 



1 
2 



8 
9 
10 



Comparative hybridization 

61. A method of comparing a target nucleic acid with" a 
reference sequence- comprising a, predetermined sequence" of 
3 nucleotides, the method comprising: - ry- r - ■ r - " 

^ hybridizing the reference sequence to an array 

5 of oligonucleotide probes immobilized on a solid support, the 

6 array comprising;- . - , ^ . . ^ 

(1) a first probe set comprising a-plurality of 
probes, each probe comprising a segment of at least 3 
nucleotides exactly complementary to a subsequence :of the 
reference sequence .except in at least one interrogation 

11 position; 

(2) " a second probe set comprising a corresponding 

13 probe for.- each probe .in, the first probe-' set the corresponding 

14 probe in the second probe set. being identical to a .sequence 

15 comprising the corresponding probe from the first probe set or 

16 a subsequence of - at. least three nucleotides thereof that 

17 ^ included the at lea;st one interrogation^ position, -except^ that 

18 the at. least one interrogation position is occupied by a 

19 different -nucleotide in each of the two ^corresponding probes 

20 from the first and second probe sets; and ^ ■ 
(h) determining which probes, relative to one 

another, ^n-. the . array^ bind specifically to , the- reference 
2 3 sequence; . 

hybridizing a target sequence to the array; 



21 
22 
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25 >••. . " :• (d) ' determining which probes ; relative to one 

26 another, "in the array bind specif ically ' to the target 

2 7 sequence; i 

28 r. /. wherein the relative 'specific binding of the probes 

29 to the reference and the target .sequence ' indicates whether the 

3 0 reference- sequence is the same or different from the target • 
31 sequence, , • ' : , ' 

1 64. The method of claim 63, whereih-^the reference 

2 sequence has a. first label and the second" reference sequence 

3 has a second label ' different from-'the -first label / and steps 

4 (a) and (c) are performed simultaneously. - 

HIV' Chip: - - - ^ ■ — : -.v,^ • :r - - \ 

1 . 65 ;;The;:^array of claim- 2; wherein ' the ref^ ■ 

2 is from a human immunodeficiency: virus % , ;:- „ 

1 ^ : . 66 • : vThe'-array-' of claim* 65,' wherein the reference-^ ^; 

2 sequence is from a reverse transcriptase gene of- the human" 

3 immunodeficiency.^ virus . " " : / v ? : 



2 sequence is from a ' protease gene of the- human -immunodeficiency 

3 virus. 

1 /; ' 68 ;* 'The ' array -of '.claim- 66 wherein the reference- • - 



1 



67 . ■ The array of claim ' 66 , " wherein the' ref erence- 



2 



sequence - is - ^-"full-length reverse* transcriptase * gene . • 



2 



1 




2 



1 



. " .:.70. . The'array of claim 66, wherein the- HIV gene is from 
the BRU HIV strain. * * - ■ - - ^ 



2 



1 



71 r-^ The array of claim 66 , - wherein' the-'HIV - gene ' is -from 
the SF2 HIV strain. ; •: 
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1 '-TZ.' -The array of claiin -2 S, wherein the^ -reference ^ 

2 sequence is from the coding strand of- a. reverse transcriptase 

3 gene of a human iimunodef iciency virus and the second - . . . 

4 reference sequence is froKi the noncoding strand of the reverse 

5 transcriptase gene^ - - v - - ^r. ,.. ... _ : 

1 73. The array of claim 28, wherein the first reference 

2 sequence is from a reverse- transcriptase gene of a human 

3 immunodeficiency virus and the '^second reference- sequence 

4 comprises a'^subsequence of the first^ reference sequence with a 

5 substitution of at least one nucleotide * : 

1 74. The array of claim 73, wherein the substitution ■ - 

2 confers drug resistance to a human immunodeficiency virus 

3 comprising the second reference sequence. - .-.^ ^. 

1 75. The array of claim 28, wherein the first and second 

2 reference sequences are- from' a reverse transcriptase gene from 

3 first and.second strains of a human immunodeficiency -virus , - - 

1 76. The array- of claim: 28V wherein the' first reference 

2 sequence is' from> a reverse transcriptase :gene of a human 

3 immuhbdef iciency ^ virus and the second reference sequence is 

4 from a 16S" RKA,- or DNA encoding the 16S RNA, . f rpm a pathogenic 

5 microorganism. - > - ' : r - 



1 77. The array of claim 28, wherein the first reference 

2 sequence is from a reverse transcriptase gene of a human 

3 immunodeficiency virus: and the second reference sequence is 

4 from, av protease .gene of .tiKe human iminuriodef iciency virus.- 

1 78. The method of claim 54, wherein the reference 

2 sequence- is: from- a: human . dmmunodeficiency virus'. :•' 

1 .1 ... 7.9. - The -method of - claim. 7S, wherein^ the reference r *. . 

2 sequence is from a human immunodeficiency , virus; and the target 

3 sequence is from a second human immunodeficiency virus. 
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1 80. - The -method of claim 79, .wherein the target sequence 

2 has..-a .substituted nucleotide .relative to the reference • 

3 sequence . in at least one undetermined position, and the 

4 relative, specific binding of the probes indicates the location 

5 of the position and the nucleotide occupying the position ,in 

6 the target sequence, 

1 8,1-.. The method of.:Claim 80, wherein the target, sequence 



2 has a substituted. .nucleotide, relative to the reference. 

3 sequence in. at least one* position , the substitution conferring 

4 drug resistance to the human immunodeficiency virus,, and the. 

5 relative specific binding of the probes reveals the 

6 substitution. . ; . . 



1 82. The method of claim. 78 .wherein: r • - ■ 

2 the hybridizing step comprises hybridizing the 

3 target nucleic acid and a second target nucleic acid, the 

.4 -second target- sequence being from a reyerse^ transcriptase gene. 

5 of a third human iinmunodef iciency - virus to .the array; and . 

6 the determining step comprises determining which 

7 probes, .relative to one- another , in- the array- bind;; 

8 specifically, to the target nucleic acid . or the second target -i.. 

9 nucleic, acid,- the- relative- specific binding of the, probes 

10 indicating whether , the target sequence, is ..the, same, or 

11 different from the reference sequence and whether;:the second 

12 target sequence is the same or different from the reference 

13 sequence. . . : . \. , r..; * 

1 83. 'The. method- of: claim 82,.. wherein the first:, .target ; 

2 sequence: has a, f irst .label, .and - the second .target, .sequence has 

3 . a second label different from the first label. 

1 84. The method of claim 82 ,.\ wherein .^undetermined, first . 

2 and second proportions of the first and second target 

3 sequences are .hybridized. to. the array and the .specific binding 

4 'indicates the„ proportions.. - ; , :. . ■ - ; . . 
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1 85 . ' /The array of claim 2 / wherein the ref erence sequence 

2 is from a CFTR gene- 

1 • 86. ..The .array of: claiin 85 , wherein the reference : ' 

2 sequence , is axon 10 of a. CFTR .gene , -and . said array - comprises 

3 over 1000 oligonucleotide probes, 1,0- to 18 . nucleotides in . . 

4 length. 

1 . 87.- The array: of . claim : 8 5-, . wherein said array comprises 

2 a set of probes comprising a specific nucleotide -sequence 

3 selected from the group of sequences comprising: 

4 3 •-TTTATAXTAG; 

5 ,3 TTATAGXAGA;, ; . - 

6 . 3 . TATAGTXGAA; - : ■ : % . . : - ; . . , . . . . . — : 

^ 7. ..3.V- . _ ATAGTAXAAA; ^ ^^^^^^^^^^^^^^^^^^^^^^^^ - , . . .. ^ 

8 3 V- . TAGTAGXAAC ; 

9 3»- , AGTAGAXAGC;^ ; - . ; v: . : 

10 3'- jGTAGAAXCCA; . , ^ . _ , 

11 3'- ;^ ... TAGAAAXCAC;-:and . ; . ; ... : ;v — c/ -v 

^2 3'- AGAAACXACA; wherein each set comprises 4 probes,; ^ 

13 and X is, dndividually A,; G,- C, and T for each set.. 

1 • 88. . The array; of claim 85, wherein, said ^group .o.f^ , 

2 sequences comprises:, ; ; ^ 

3 3 '-TTTATAXTAGIAAACC; . ; . ... 

4 3'- TTATAGXAGAAACCA; 

5 3'- TATAGTXGAAACCAC; - : . - 

6 3'--. -.ATAGTAXAAACCACA; _ 

"7 -31*;- : : ;TAGTAGXi^ccACAA;/ ^ : : ■ V./ . - 1 .t , :r:.i . 

8 3,'- AGTAGAXACCACAAA;, - - , - - * 

^ 2 ' : ,: . GTAGAAXCCACAAAG ; . . — . . 

10 3.r- ^ ^ , J T7VGAAAXCACAAAGG and ^. ^ . . . , . / . 

11 3'-- AGAAACXAGAAAGGA,\ wherein, each: set comprises ,4;-. 

12 probes,; and X is Individually Ay g, and; T f or ;each: set. 



1 .89-.,., The array .of ..claim: 3 2, _where in: the -forty, ;f irst 

2 reference sequences are from a CFTR gene. 
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1 90. The array of claim 89, wherein each of the forty^ . 

2 first 'reference . sequences -includes a site of a mutation and at . 

3 least one adjacent nucleotide. 

1 91. . --.The array of claim 90, wherein each of, the forty 

2 first reference sequences comprises at .least. five :contiguous 

3 nucleotides - from a CFTR gene. 

1 92. The array of claim 89, wherein at least one first 

2 reference -sequence- is . a from the coding- strand of 'the cystic 

3 fibrosis gene and at least one/ f irst reference - sequence - is 

4 from the noncoding strand of -the CFTR gene.. . . 

1 93. An array of oligonucleotide probes immobilized on a 

2 solid support, the array comprising at least a groiip^ of probes 

3 comprising: ..... .,-:r' 

, 4 a wildtype probe exactly complementary to- a subsequence 

5 of a reference sequence from, a cystic fibrosis gene/- the 

6 segment having at least five interrogation positions 

7 corresponding to five contiguous nucleotidesv in the reference"'- 

8 sequence., - - ^ -v " ^ - \ . - . . ' 

9 a first set -of three mutant probes, each identical to th4 

10 wildtype probe, except in a first of the five interrogation 

11 positions,- which -is occupied by a different- nucleotide in each 

12 of the three mutant probes and the wildtype- probe'; 

13 a second set of three mutant probes, each identical to 

14 the wildtype probe, except in a second of^ the -five- 

15 interrogation positions, which is occupied by ^a different 

.16 nucleotide in each of the three mutant probes and the wildtype- 

jr;17 probe; " ■ 

-•:il8 a third set of three mutant probes , ; each - identical to the 

"19 wildtype probe, except in a third of - the five interrogation 

20 positions, which is occupied by a different -nucleotide in each 

21 of the three mutant-, probes and. the wildtype < probe; 

22 ^ a' fourth set of ^three mutant probes,- each- identical to ' 

23 the wildtype probe, except in a fourth of the five 

24 interrogation positions, which is- occupied : by a different 
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25 nucleotide in each^ of the, threevMnutant probes and : the* wildtype 

2 6 probe ;. : - ' . _ . 

27 a fifth sef of three mutant probes, each identical to the 

28 wildtype, probe, except in a. fifth of. the five- interrogation. 

29 positions-, which is occupied by a different nucleotide - in each. 

3 0 



1 
2 
3 
4 
5 
6 
7 



Of the.-. three mutant probes, and the wildtype probe. - 

-94, The. array of claim 93 comprising first and second 
groups of probes,, each . group as defined by, claim 93 the first 
group comprising a wildtype probe exactly complementary to a . 
first reference sequence, and the second group; comprising a 
wildtype probe exactly complementary to a second reference 
sequence, wherein the second reference .sequence is a mutated- 
form of the first- reference sequence. , . , 



^ - 95. The array of claim^-9^^^ the first feference: 

2 sequence is from a CFTR gene and the second reference sequence 

3 is a mutated form of the first reference sequence. 

1 . 96.,, The method, of ...claim 56,. wherein- the target - sequence . 

2 and the second target sequence, are from heterozygous alleles 

3 of a CFTR gene, 

P53 Chip 

1 97, The array, of ..claim 2, •wherein .the reference:- sequence 

2 is . a sequence.^from ^a p53 . gene. . , 

1 98., The array of claim 2, wherein the reference sequence 

2 is from an hMLHl gene. 

1 99. The array of claim 2, wherein the. reference sequence 

2 is from, an MSH2-. gene. . 

1 100. The array of claim 28 , wherein, the reference 

2 sequence., is- from a human. P53 . gene . and the, second . reference 

3 sequence is from an hMLHl gene..- . 



101. The array of claim 100, furth 



er comprising: 
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2 ninth/' tenth, • eleventh and twelfth probe . sets,. ' ^ 

3 (1) the ninth probe set comprising a plurality of' 

4 prolDes , ' each probe comprising' a. segment of at least three 

5 nucleotides exactly complementary to a subsequence of a third 

6 reference sequence, the segment including at least one": 

7 interrogation position complementary to a corresponding 

8 nucleotide in the third reference sequence, 

9 (2) the tenth, eleventh and twelfth probe sets, 



10 each comprising a' corresponding probe, for each probe' in- the 

11 ninth probe set the , probes in the tenth,- eleventh and" twelfth 

12 probe' sets being identical to -a sequence comprising- the • 

13 corresponding probe from the ninth probe set or a subsequence 

14 of at 'least three nucleotides thereof that includes the- at ^ 

15 least one interrogation position, except that the at least one 

16 interrogation position is occupied by a different nucleotide 

17 in eabh of the four corresponding probes * from the ninth, 

18 tenth, eleventh ' and twelfth probe sets.^- - - * ' 



1 102. The array of claim 97, wherein the first probe set 

2 has - at least" 60 interrogation * positions * corresponding ' to at 60 

3 contiguous nucleotides from exon 6. ' : - . : .^^ 

1 i03. The array of claim 98, wherein the reference 

.2 sequence is exon 5 of a p5 3 gene, the probes are 17 , 

3 nucleotides long, "and the first set of probes is exactly 

4 complementary to the reference sequence;- and the ' at least one 

5 interrogation position is at position 7, relative to a 3 * -end ■ 

6 of each probe, which '3 • -end ■ is covalently attached 'to the 

7 substrate, 

Mitochondrial Chip ; ' - ' ' 

1 104.- The array of claim 2, wherein the -reference ■ 

2 sequence is from a mitochondrial genome. 

1 ^ ■ 105 .* The array of claim ^104 wherein said'" reference ' 

2 sequence is a sequence of a D-loop' region . ■ - :■ ' - 
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1 106. The array of claiiri 105, wherein D-loop region is 

2 full-length. 

1 107. , The array of ^claiirt 104, wherein said -reference 

2 sequence is at least 90% of a full-length" mitochondrial 

3 genome . „■ 

1 108. The, array of claim 104, wherein the reference 

2 sequence, is bounded by positions 16280 to 356 of the 

3 mitochondrial genome . 
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Tiled Array with Probes for the Detection 
of Point Mutations 



3 ' -ccgactacagtcg: 
2 ' -ccgactccagtcg: 
3 • -ccgactgcagtcg: 
3 • -ccgacttcagtgg; 
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Array Design for the R5S3X Point Mutation 



Wild-Type Pattern 



urt , 



n-2 



Position 
n-1 n n+1 n+2 



Base 



...cNaCgag.., 
...cNaTgagw.. 
...caNCgag... 
...caNTgag... 
...caaNgag... 




...caaTgNg^:. '^ 3 
^caaCgNg.w, ^^J ^ ^ 
-xaaTNag... ^ 
-.caaCNag... V e S 
•..caaNgag.- 



Exact complement 



Single bascrpair mismatch 



Wild-Type Sequence: 5'-AGGTCAaCgaGCAA-3' 
Mutant Sequence: 5'.AGGTCAaTgaGCAA-3' 



Fi2. 16 
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Array Design for the R5S3X Point Mutation 



Heterozygote Pattern 



Position 
n-2 n-1 n n+1 n+2 
Base |W. ^ m |W^ m |W 1^ t"| ^ i" ! 



A 
C 
G 



.cNaCgag.., 
.cNaTgag..; 
xaNCgag,.. 
.caNTgag„. 
..caaNgag... 




E 



-.caaTgNg-, 

— ~.caaCgNg._ 
7- -.caaTNag... 

— -.caiaCNag.- 

— ...caaNgag^ 



Wild-Type Sequence: 5'-AGGTCAACgaGCAA-3' 



Mutant Sequence: 



5'-aggtcaaTgagcaa-3' 



Fig. 17 
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